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Fig. 1 



AGATAGAGAATTTTCTTATTTAGACTTTGTGTCTACTCCTCTCAACTAAACGAAATTTT-TCTAGTGCTGTCATTTGTTAT6GCAGTCCTAGT 

j h 1 — h 1 I 1 — i 1~ — I 1 i . ■ 1 1 1 — 92 

TCTATCTCTTAAAAGAATAAATCTGAAACACAGATGAGGAGAGTTGATTTGCTTTAAAAAGATCACGACAGTAAACAATACCGTCAGGATCA 



| 5'UTR ——————— — — i^— — 

GTAATTGAAATTTCGTCAAGTTTGTAAACTGGTTAGGCAAGTGTTGTATTTTCTGTGTTTAAGCACTGGTGGTTCTGTCCACTAGTGCACAC 

— ■ H 1 1 1 1 1 K— h ! 1 h~ 1 f— h 1 H— 184 

CATTAACTTTAAAGCAGTTCAAACATTTGACCAATCCGTTCACAACATAAAAGACACAAATTCGTGACCACCAAGACAGGTGATCACGTGTG 



■ 5'UTR ■ 



ATTGATACTTAAGTGGTGTTCTGTCACTGCTTATTGTGGAAGCAACGf TCTGTCGTTGTGGAAACCAATAACTGCTAACCATGTTTTACAAT 

h 1 1 ! -h 1 1 1 1 1 « |— h 1 1— 1 1 H 1- 276 

TAACTATGAATTCACCACAAGACAGTGACGAATAACACCTTCGTTGCAAGACAGCAACACCTTTGGTTATTGACGATTGGTACAAAATGTTA 



■ 5'UTR — — — — ^^— ^— M F Y N 

L Replicase 1a- 



CAAGTGACACTTGCTGTTGCAAGTGATTCGGAAATTTCAGGTTTTGGTTTTGCCATTCCTTCTGTAGCCGTTCGCGCTTATAGCGAAGCCGC 

1 | 1 1 1 i — I 1 I \ ' 1 1 1 I 36S 

GTTCACTGTGAACGACAACGTTGACTAAGCCTTTAAAGTCCAAAACCAAAACGGTAAGGAAGACATCGGCAAGCGCGAATATCGCTTCGGCG 

QVTLAVASDSE I5GFGFA I PSVAVRAYSEAA' 
Replicase 1a : : — ~ — ■ 



TGCACAAGGTTTTCAGGCATGCCGCTTTGTTGCTTTTGGCTTACAGGATTGTGTAACCGGTATTAATGATGACGATTATGTCATTGCATTGA 

H 1 1 I 1 1 h- 1 h f h 1 h 1 1— 1 1— h <|60 

ACGTGTTCCAAAAGTCCGTACGGCGAAACAACGAAAACCGAATGTCCTAACACATTGGCCATAATTACTACTGCTAATACAGTAACGTAACT 

AQGFOACRFV A * F G I Q.O C V T.G INDDDYVIAL 
Replicase 1a 



CTGGTACTAATCAGCTTTGTGCCAAAATTTTACTTTTTTCTGATAGACCTCTTAATTTGCGAGGTTGGCTCATTTTTTCTAACAGCAATTAT 

1 1 ■ ■ | ■ 1 1 1 h < 1 I ^ 1 I — h 1 — 552 

GACCATGATTAGTCGAAACACGGTTTTAAAATGAAAAAAGACTATCTGGAGAATTAAACGCTCCAACCGAGTAAAAAAGATTGTCGTTAATA 

TG T N. Q I C A K I LLFSDRPLNLRG WL I FSNSNY 
— — * Replicase 1a 



GTTCTTCAGGACTTTGATGTTGTTTTTGGCCATGGTGCAGGAAGTGTGGTTTTTGTGGATAAGTATATGTGTGGTTTTGATGGTAAACCTGT 

— h— H 1 1 »— 1 H 1 1 I 1 | i 1 1 ■ I ■ ■ ■ 644 

CAAGAAGTCCTGAAACTACAACAAAAACCGGTACCACGTCCTTCACACCAAAAACACCTATTCATATACACACCAAAACTACCATTTGGACA 

VLQOFOVVFGHGAGSVVFVDKYMCGFDGKPV 
: r- Replicase la- — — — — 1 - 



GTTACCTAAAAACATGTGGGAATTTAGAGATTACTTTAATGATAATACTGATAGTATTGTTATTGGTGGTGTCACTTATCAATTAGCATGGG 

h 1 1 1 H • 1 1 H 1 H 1 1 i ■ I I 736 

CAATGGATTTTTGTACACCCTTAAATCTCTAATGAAATTACTATTATGACTATCATAACAATAACCACCACAGTGAATAGTTAATCGTACCC 

i 

LPKNHWEFROYFNONTDSIVI GGyTYOLAW 
Replicase 1a : 



ATGTTATACGTAAAGACCTTTCTTATGAACAGCAAAATGTTTTAGCTATTGAGAGCATTCATTATCTTGGCACTACAGGTCATACTTTGAAG 

\ i ■ | 1 1 1 1 1 1 1 | i I i 1 — • 1 " 828 

TACAATATGCATTTCTGGAAAGAATACTTGTCGTTTTACAAAATCGATAACTCTCGTAAGTAATAGAACCGTGATGTCCAGTATGAAACTTC 

DV I.RKDLSYEQONVLA ! ES l HYLGTTGH TLK 
— Replicase 1a 
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TCTGGTTGCAAACTCATTAATGCCAAGCCGCCTAAATATTCTTCTAAGGTTGTTTTGAGTGGTGAATGGAATGCTGTGTATAAGGCGTTTGG 

I t I I 1 1 ' 1 I ■ i ■ 1 1 ' 1 ■ > I 1 i I 920 

AGACCAACGTTTGAGTAATTACGGTTCGGCGGATTTATAAGAAGATTCCAACAAAACTCACCACTTACCTTACGACACATATTCCGCAAACC 

SGCKL INAKPPKYSSKVVLSGEWNAVYKAFG 
: Replicase 1a — — — — — 



TTCACCATTTATTACAAATGGTATATCATTGCTAGATATAATTGTTAAACCAGTTTTCTTTAATGCTTTTGTTAAATGCAATTGTGGTTCTG 

1 1 1 1 1 1 1 1 , 1 1 1 I ■ n ~H ■ i - ■ ! I i 1 — 1012 

AAGTGGTAAATAATGTTTACCATATAGTAACGATCTATATTAACAATTTGGTCAAAAGAAATTACGAAAACAATTTACGTTAACACCAAGAC 

SPFITNG1SLLD1 I. VKPVFFNAFVKCNCGS 
— — — ^— — ^— Replicase 1 a m 



AGAATTGGAGTGTTGGTGCATGGGATGGTTATCTATCTTCTTGTTGTGGCACACeTGCTAAGAAACTTTGTGTTGTTCCTGGTAATGTTGTT 

— , 1 , 1 , 1 — -h 1 1 . h i "i i — i-h — i ' i i noi 

TCTTAACCTCACAACCACGTACCCTACCAATAGATAGAAGAACAACACCGTGTGGACGATTCTTTGAAACACAACAAGGACCATTACAACAA 

ENWSVGAWDGYLSSCCG7PAKKLCVVPGNVV 
: Replicase 1a ; : 



CCTGGTGATGTGATCATCACCTCAACTGATGCTGGTTGTGGTGTTAAATACTATGCTGGCTTAGTTGTTAAACATATf ACTAACATTACTGG 

h j 1— I 1 H-' I 1 i 1 1 i 1 I > ■ ■ ■ I »- 1196 

GGACCACTACACTAGTAGTGGAGTTGACTACGACCAACACCACAATTTATGATACGACCGAATCAACAATTTGTATAATGATTGTAATGACC 

PGOVI ITS TDAGCG'VKYYAGLVVKHITNITG 
Replicase la 



TGTGTCTTTATGGCGTGTTACAGCTGTTCATTCTGATGGAATGTTTGTGGCAACATCTTCTTATGATGCACTTTTGCATAGAAATTCATTAG 

— +. 1 1 1 1 1 r* H — ' t » > f l I ■ « • H * 1288 

ACACAGAAATACCGCACAATGTCGACAAGTAAGACTACCTTACAAACACCGTTGTAGAAGAATACTA'CGTGAAAACGTATCTTTAAGTAATC 

VSLWRVTAVHSOGMF.VATSS_YDALLH.RNSL 
Replicase 1a = 



ACCCTTTTTGCTTTGATGTTAACACTTTACTTTCTAATCAATTACGTCTAGCTTTTCTTGGTGCTTCTGTTACAGAAGATGTTAAATTTGCT 

-f— : — i 1 1 1 1 1 1 1 " 1 1 1 I 1 1 1 1- 1380 

TGGGAAAAACGAAACTACAATTGTGAAATGAAAGATTAGTTAATGCAGATCGAAAAGAACCACGAAGACAATGTCTTCTACAATTTAAACGA 

OPFCF DVNTLLSNGLRLAFLGASVTEOVKFA 
— Replicase 1a 



GCTAGCACTGGTGTTATTGACATTAGTGCTGGTATGTTTGGTCTTTACGATGACATATTGACAAACAATAAACCTTGGTTTGTACGCAAAGC 

, 1 1 1 1 1 I 1 ' . I ' i 1 — I 1 ■ — I 1 — 1472 

CGATCGTGACCACAATAACTGTAATCACGACCATACAAACCAGAAATGCTACTGTATAACTGTTTGTTATTTGGAACCAAACATGCGTTTCG 

ASTGV I 0 I SAGMFGLYOO JLTNNKPWFVRKA 
Replicase 1a '- % — ; 



TTCTGGGCTTTTTGATGCAATCTGGGATGCTTTTGTTGCCGCTATTAAGCTTGTGCCAACTACTACTGGTGGTTTGGTTAGGTTTGTTAAGT 

— i h 1 1 1 1 1 1 1 I 1 1 1 1 1 ■ I I 1564 

AAGACCCGAAAAACTACGTTAGACCCTACGAAAACAACGGCGATAATTCGAACACGGTTGATGATGACCACCAAACCAATCCAAACAATTCA 

SGLFDA IWOAFVAA IKLVPTTTGGLVRFVK 
Replicase 1a : 



CTATCGCTTCAACTGTTTTAACTGTTTCTAATGGTGTTATTATTATGTGTGCAGATGTTCCAGATGCTTTTCAACCAGTTTACCGCACATTT 

■* 1 ' 1 ' 1 • h ^ 4 1 1 1 H 1 ~H 1 1 h- 1 656 

GATAGCGAAGTTGACAAAATTGACAAAGATTACCACAATAATAATACACACGTCTACAAGGTCTACGAAAAGTTGGTCAAATGGCGTGTAAA 

SIASTVLTVSNGVI lMCAOVPOAFOPVYRTF 
— ■ Replicase 1a 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 



3/87 



PCT/NL2004/000805 



ACACAAGCTATTT6TGCTGCATTTGATTTTTCTTTAGATGTATTTAAAATTG6TGATGTTAAATTTAAACGACTTGGTGATTATGTTCTTAC 

1 ' I * i i i | i i i i ) i i , ■ ! „ i r i i , ■ , | , . i h_ | 7 (j 8 

TGTGTTCGATAAACACGACGTAAACTAAAAAGAAATCTACATAAATTTTAACCACTACAATTTAAATTTGCTGAACCACTAATACAAGAATG 

TGA I C A A TOF SLOVFKIGDVK F K RLGOY VL T 
— ■ Replicase 1a — — 



TGAAAATGCTCTTGTTCGTTTGACTACTGAAGTTGTTCGTGGTGTTCGTGATGCTCGCATAAAGAAAGCCATGTTTACTAAAGTAGTTGTAG 

_4 , j , 1 ■ 1 1 1 . 1 1 H 1 1 I 1 1- 1840 

ACTTTTACGAGAACAAGCAAACTGATGACTTCAACAAGCACCACAAGCACTACGAGCGTATTTCTTTCGGTACAAATGATTTCATCAACATC 

ENALVRITTEVVRGVROAR I K&AMFTKVVV 
Replicase 1a 



GTCCTACAACTGAAGTTAAGTTTTCTGTTATTGAACTTGCCACrGTTAATTTGCGTCTTGTTGATTGTGCACCTGTAGTTTGCCCTAAAGGT 

, j , 1 , \ , , | |. 1 ~H 1 I — 1932 

CAGGATGTTGACTTCAATTCAAAAGACAATAACTTGAACGGTGACAATTAAACGCAGAACAACTAACACGTGGACATCAAACGGGATTTCCA 

GPTTEVKFSV I ELATVNLRLVOCAPVVC P K G 
: Replicase 1a 



AAAATTGTTGTTATTGCTGGACAAGCTTTTTTCTATAGTGGTGGTTTTTATCGTTTTATGGTTGATTCTACAACTGTATTAAATGACCCTGT 

— , , , 1 ■ h * 1 1 1 i H 1 1 | ■ ■ ■ . 1 h— ■ 2024 

TTTTAACAACAATAACGACCTGTTCGAAAAAAGATATCACCACCAAAAATAGCAAAATACCAACTAAGATGTTGACATAATTTACTGGGACA 

K IVV 1 AGQAFFYSGGFYRFMVDSTTVLNOPV 
: Replicase 1a 



TTTTACTGGTGAGTTATTTTATACTATTAAGTTTAGTGGTTTTAAGCTTGATGGTTTTAACCATCAGTTTGTTAATGCTAGTTCTGCTACAG 

+h 1 . 1 ~+h h — < 1 1. m-h 1 ^ h 1 f h 1 ■ I '21 16 

AAAATGACCACTCAATAAAATATGATAATTCAAATCACCAAAATTCGAACTACCAAAATTGGTAGTCAAACAATTACGATCAAGACGATGTC 

FTGELFYT I KFSGFK.LDGFNH". QFVNASSAT 
Replicase 1a ■ j i 



ATGCCATTATTGCTGTTGAGCTGTTGTTATCGGATTTTAAAACTGCAGTTTTTGTGTACACATGTGTGGTTGATGGTTGTAGTGTCATTGTT 

1 , 1 ^ ^ 1 j- , 1 , } ■ , , *-H » 1 1 1 1 2208 

TACGGTAATAACGACAACTCGACAACAATAGCCTAAAATTTTGACGTCAAAAACACATGTGTACACACCAACTACCAACATCACAGTAACAA 

OAI IAVELLISDFKTAVFVYTCVVOGCSVIV 
Replicase 1a - 



AGACGTGATGCTACATTCGCCACACATGTGTGTTTTAAGGACTGTTATAGTATTTGGGAGCAATTCTGCATTGATAATTGTGGTGAGCCATG 

H , 1 , 1 j I 1 1 1 1 1 1 J- 2300 

TCTGCACTACGATGTAAGCGGTGTGTACACACAAAATTCCTGACAATATCATAAACCCTCGTTAAGACGTAACTATTAACACCACTCGGTAC 

RRDATFATHVCFKDCYSIWEOFC IONCGEPW 
— — Replicase 1a : 



GTTTTTGACTGATTATAATGCTATCTTGCAGAGTAATAACCCTCAATGTGCTATTGTTCAAGCATCGGAGTCTAAAGTTTTGCTTGAGAGGT 

, 1 1 1 ■ \ ^ j 1 j , 1 1 1 1 1 1 — 2392 

CAAAAACTGACTAATATTACGATAGAACGTCTCATTATTGGGAGTTACACGATAACAAGTTCGTAGCCTCAGATTTCAAAACGAACTCTCCA 



FLTOYNA IIQSNNPOCAIVQASESKVLIER 
Replicase 1 a 



TTTTACCTAAGTGTCCTGAAATACTGTTGAGTATTGATGATGGCCATTTATGGAATCTTTTTGTTGAAAAGTTTAATTTTGTTACAGATTGG 

■ ■ 1 I i I ■ H ' ' 1 I i 1 I 1 1 H 1 I 2484 

AAAATGGATTCACAGGACTTTATGACAACTCATAACTACTACCGGTAAATACCTTAGAAAAACAACTTTTCAAATTAAAACAATGTCTAACC 

FLPKC PE I LLS iooghlwnlfvekfnfvtow 
Replicase 1a ■ 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

4/87 



TTAAAAACTCTTAA6CTTACACTTACTTCTAATGGTCTTTTAGGTAATTGTGCCAAACGTTTTAGACGTGTTTTGGTAAAATTGCTT6ATGT 

* 1 ■ i " 1 1 -+- | t I H , ~| ^ 1 _ 2576 

AATTTTTGAGAATTCGAATGTGAATGAAGATTACCAGAAAATCCATTAACACGGTTTGCAAAATCTGCACAAAAGCATTTTAACGAACTACA 

LKTLKLTLTS N GllGNCAKRFRRVLVKLLDV 
— Repiicase 1 a 



CTATAATGGTTTTCTTGAAACTGTCTGTAGTGTCGTACACACTGCTGGTGTTTGCATTAAATATTATGCTGTTAATGTTCCATATGTAGTTA 

— I 1 1 ' ' 1 1 1 ' i I 1 " i | i | ■ ■ i ■ ■ ■ ■ 1 1 2668 

GATATTACCAAAAGAACTTTGACAGACATCACAGCATGTGTGACGACCACAAACGTAATTTATAATACGACAATTACAAGGTATACATCAAT 

YN.GFLETVCSVVHT AGVC 1KYY V AVNVPYVV 

Repiicase 1a 



TTAGTGGTTTTGTAAGTCGTGTAATTCGTAGAGAAAGGTGTGACGTGACTTTTCCTTGTGTTAGTTGTGTCACTTTTTTCTATGAATTTTTA 

H " 1 1 1 I • 1 1 ! 1 1 1 ■ " ■ I 1 h 2760 

AATCACCAAAACATTCAGCACATTAAGCATCTCTTTCCACACTGCACTGAAAAGGAACACAATCAACACAGTGAAAAAAGATACTTAAAAAT 

I SGFVSRV I RRERCDVTFPCVS CVTFFYEFL 
Repiicase 1a 



GACACGTGTTTTGGTGTTAGTAAACCTAATGCCATTGATGTTGAACATTTAGAGCTTAAAGAAACTGTTTTTGTTGAACCTAAGGATGGTGG 

— h 1 1 I » I ■ »« ■ i 1 I I I ■ 2852 

CTGTGCACAAAACCACAATCATTTGGATTACGGTAACTACAACTTGTAAATCTCGAATTTCTTTGACAAAAACAACTTGGATTCCTACCACC 

OTCFGVSKPNA I DVEHLEIKET .VFVEPK OGG 
Repiicase 1a 



TCAATTTTTTGTTTCTGATGATTATCTTTGGTATGTTGTAGATGACATTTATTATCCAGCTTCATGTAATGGTGTATTGCCAGTTGCTTTTA 

— . 1 h~ | i 1 1 • • ■ I i ■ i Si i 1 f 1 — I — | ■ ■ ■ i 29^^ 

AGTTAAAAAACAAAGACTACTAATAGAAACCATACAACATCTACTGTAAATAATAGGTCGAAGTACATTACCACATAACGGTCAACGAAAAT 

GFFVSOOYLWYVVDDI YYPASCNGVLPVAF 
Repiicase 1a . : 



CAAAATTGGCAGGTGGTAAAATATCTTTTTCTGATGATGTTATAGTTCATGATGTTGAACCTACCCATAAAGTCAAGCTCATATTTGAGTTT 

1 1 1 1 I I I I I 3036 

GTTTTAACCGTCCACCATTTTATAGAAAAAGACTACTACAATATCAAGTACTACAACTTGGATGGGTATTTCAGTTCGAGTATAAACTCAAA 

TKLAGGK I SFSODVIVHOVEPTHKVKL I FEF 
Repiicase 1a 



GAAGATGATGTTGTTACCAGTCTTTGTAAGAAGAGTTTTGGTAAGTCTATTATTTATACAGGTGATTGGGAAGGTTTACATGAAGTTCTTAC 

1 1 1 1 1 1 1 i| i 1 1 I 1 1 3128 

CTTCTACTACAACAATGGTCAGAAACATTCTTCTCAAAACCATTCAGATAATAAATATGTCCACTAACCCTTCCAAATGTACTTCAAGAATG 

EDDVVTSLCKKSFGKSI I YTGDWEGLHEVLT 

; Repiicase la : 



ATCTGCAATGAATGTCATTGGGCAACATATTAAGTTGCCACAATTTTATATTTATGATGAAGAGGGTGGTTATGATGTTTCTAAACCAGTTA 

H ' ■ ■ I »— 1 1 1 1' ■ I 1 1 1 I < ■ ■ » ■ I " 1' ■ ■ I 3220 

TAGACGTTACTTACAGTAACCCGTTGTATAATTCAACGGTGTTAAAATATAAATACTACTTCTCCCACCAATACTACAAAGATTTGGTCAAT 



SAMNV I GOH IKLPQFY IYDEEGGYDVSKPV 
Repiicase 1a 



TGATTTCACAATGGCCTATTAGTGATGATAGTGATGGTTGTGTTGTTGAAGCGAGCACTGATTTTCATCAATTAGAATCTGTTAGAGAAGAG 

1 1 1 1 1 1 1 1 I 1 ■ I .ii | 1 I — 3312 

ACTAAAGTGTTACCGGATAATCACTACTATCACTACCAACACAACAACTTCGCTCGTGACTAAAAGTAGTTAATCTTAGACAATCTCTTCTC 

Ml SOWP I SODSD GCVVEASTOFHOLESVREE 
— Repiicase 1a 



SUBSTITUTE SHEET (RULE 26) 
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GTTGATATAATTGAACAACCTTTTGGGGAAGTTGAACATGCGCTCTCAATTAGACAACCTTTTTCTTTTTCTTTTAGAGAJGAATTGGGTGT 

— ' I ' I " 1 ' I i I » 1 >■ 1 ! > I 3qo<l 

CAACTATATTAACTTGTTGGAAAACCCCTTCAACTTGTACGCGAGAGTTAATCTGTTGGAAAAAGAAAAAGAAAATCTCTACTTAACCCACA 

VDI I EQPFGEVEHALS i RQPFSFSFRDELGV 
— — ■ — -Replicas© la 



TCGTGTTTTAGATCAATCTGATAATAATTGTTGGATTAGTACCACACTTATACAGTTGCAACTTACAAAGCTTTTGGATGATTCTATTGAGA 

< 1 ' 1 1 1 ^ 1 ' 1 1 1 ■ 1 H; 1 I ' ■ ' | i 3498 

AGCACAAAATCTAGTTAGACTATTATTAACAACCTAATCATGGTGTGAATATGTCAACGT.TGAATGTTTCGAAAACCTACTAAGATAACTCT 

R VLOQSONNCWISTTL I QLQLT, KLLODS ! E 
■ Replicase 1a 



TGCAATTGTTTAAAGTTGGTAAAGTTGATTCAATTGTTCAAAAGTGTTATGAGTTGTCTCATTTAATTAGTGGTTCACTTGGTGATAGTGGT 

I 1 1 1 1 h ! M i i i i \ 1 I ■ 3588 

ACGTTAACAAATTTCAACCATTTCAACTAAGTTAACAAGTTTTCACAATACTCAACAGAGTAAATTAATCACCAAGTGAACCACTATCACCA 

MQLFKVGKVOSIVQKCYELSHLISGS LGDSG 
Replicase 1a 



AAACTTCTTAGTGAACTTCTTAAAGATAAATATACATGTTCTATAACTTTTGAGATGTCTTGTGATTGTGGTAAAAAGTTTGATGAGCAAGT 

H 1 1 1 1 1 1 i (— ( 1 1 1 1 ~h H 1 h 3680 

TTTGAAGAATCACTTGAAGAATTTCTATTTATATGTACAAGATATTGAAAACTCTACAGAACACTAACACCATTTTTCAAACTACTCGTTCA 

KLLSELLKOKYTCSITF-EMSCDCGKKFDEOV 
Replicase 1a : 



TGGTTGTTTGTTTTGGATTATGCCTTACACAAAACTTTTTCAAAAAGGTGAGTGTTGTATTTGTCATAAAATGCAGACTTATAAGCTTGTTA 

. 1 . 1 1 1 . 1 . — i — i 1 \ ■ 1 »~- H I — 3772 

ACCAACAAACAAAACCTAATACGGAATGTGTTTTGAAAAAGTTTTTCCACTCACAACATAAACAGTATTTTACGTCTGAATATTCGAACAAT 

GCLFWI MPYTKLFQKGECC I CHKMQTYKLV 
: Replicase 1a ; ■ ■• ! r ; . 



GTATGAAAGGTACTGGTGTGTTTGTACAGGATCCAGCACCTATTGACATTGATGCTTTCCCTGTTAGACCTATATGTTCATCTGTATATTTA 

» I I ' ' . i I " i I i 1 " ■ ■ i I I ■ 3664 

CATACTTTCCATGACCACACAAACATGTCCT AGGTCGTGGATAACTGTAACTACGAAAGGGACAATCTGGATATACAAGTAGACATATAAAT 

SMKGTGVFVODPA.P IDI DAFPVRP 1 CSSVYL 
Replicase 1a 



GGTGTTAAGGGTTCTGGTCATTATCAAACAAATTTATACAGTTTTGACAAAGCTATTGATGGTTTTGGTGTCTTTGACATTAAAAATAGTAG 

h 1 1 1 1 1 1 i | i | ■ .i h h 1 m 1 h i-H >- 3956 

CCACAATTCCCAAGACCAGTAATAGTTTGTTTAAATATGTCAAAACTGTTTCGATAACTACCAAAACCACAGAAACTGTAATTTTTATCATC 

GVKGSGHYQTNLYSFDKA 1 OGFGVFD IKNSS 
Replicase 1a 



TGTTAATACTGTTTGTTTTGTTGATGTTGATTTTCATAGTGTAGAAATAGAAGCTGGTGAAGTTAAACCTTTTGCTGTATATAAAAATGTTA 

1 h h+h 1 1 1 1 • 1 1 1 1 1 1 • 4048 

ACAATTATGACAAACAAAACAACTACAACTAAAAGTATCACATCTTTATCTTCGACCACTTCAATTTGGAAAACGACATATATTTTTACAAT 

VNTVCFVDVDFHSVE I EAGEVKPFAVYKNV 
Replicase 1a 



AATTTTATTTAGGTGATATTTCACACCTTGTAAACTGTGTTTCTTTTGACTTTGTTGTCAATGCTGCTAATGAAAATCTCATGCATGGAGGC 

1 I I i I ' I ' ■ i I i I I i I ■ 1 . ■ I 4140 

TTAAAATAAATCCACTATAAAGTGTGGAACATTTGACACAAAGAAAACTGAAACAACAGTTACGACGATTACTTTTAGAGTACGTACCTCCG 

K FYLGO I SHLVNCVSFDFVVNAANENLMHGG 
Replicase 1a 



SUBSTITUTE SHEET (RULE 26) 
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GGTGTCGCACGTGCTATTGATATTTTGACTGAAGGTCAACTTCAGTCATTATCTAAAGATT-ACATTAGTAGTAATGGTCCACTTAAGGTTGG 

• I I > ' 1 ■ t I > i I i I ' 1 1 1 1 1 — 4232 

CCACAGCGTGCACGATAACTATAAAACTGACTTCCAGTTGAAGTCAGTAATAGATTTCTAATGTAATCATCATTACCAGGTGAATTCCAACC 

GVARA I D ILTEGOLOSISKOY ISSNGPLK VG 
; Replicase 1a 



AGCAGGTGTTATGTTGGAGTGTGAAAAATTCAATGTATTTAATGTTGTTGGTCCGCGAACTGGTAAACATGAGCATTCATTACTTGTTGAAG 

I ' 1 1' i I > ■ ' ■ ■ t I 1 ■ ■ < I ■ i I » » i I 4324 

TCGTCCACAATACAACCTCACACTTTTTAAGTTACATAAATTACAACAACCAGGCGCTTGACCATTTGTACTCGTAAGTAATGAACAACTTC 

AGVMLECEKFNVFNVVGPRTGKHEHSLLVE 
Replicase 1a : 



CTTATAATTCTATTTTATTTGAAAATGGTATTCCACTTATGCCTCTTCTTAGTTGTGGTATTTTTGGTGTAAGGATTGAAAATTCTCTTAAA 

h 1 . 1 1 1 1 1 1 1 1 ~h 1 1 1 . 1 4416 

GAATATTAAGATAAAATAAACTTTTACCATAAGGTGAATACGGAGAAGAATCAACACCATAAAAACCACATTCCTAACTTTTAAGAGAATTT 

AYNS 1 LFENG I PLMPLISCG IFGVR IENSLK 
Replicase 1a 



GCTTTGTTTAGTTGTGACATTAATAAACCATTGCAAGTTTTTGTTTATTCTTCAAATGAAGAACAAGCTGTTCTTAAGTTTT TAGATGGTTT 
1 1 i ■ i ■ i i i i i ■ i ■ ■ | | • i | | 4508 

CGAAACAAATCAACACTGTAATTATTTGGTAACGTTCAAAAACAAATAAGAAGTTTACTTCTTGTTCGACAAGAATTCAAAAATCTACCAAA 

ALFSCO I NKPLQVFVYSSNEEQAV. LKFLDGl 
1 Replicase 1a : 



AGATTTAACACCAGTCATTGACGATGTTGATGTTGTTAAACCTTT.TAGAGTTGAAGGTAATTTTTCATTCTTTGATTGTGGTGTCAATGCCT 

H 1 1 1 1 1 1 ► H 1 h— • h 1 1^ i 4600 

TCTAAATTGTGGTCAGTAACTGCTACAACTACAACAATTTGGAAAATCTCAACTTCCATTAAAAAGTAAGAAACTAACACCACAGTTACGGA 

OLTPV I DOVDVVKPFRVEGNFSFFOCGVNA 
s — Replicase 1 a • : 



TGGATGGTGATATTTACTTATTATTTACTAACTCTATTTTAATGTTGGATAAACAAGGACAATTATTGGACACAAAACTTAATGGTATTTTG 

1 1 , 1 1 1 1 rH 1 1 1 \ 1 1 1 H— 4692 

ACCTACCACTATAAATGAATAATAAATGATTGAGATAAAATTACAACCTATTTGTTCCTGTTAATAACCTGTGTTTTGAATTACCATAAAAC 

LOGDI YLIFTNSILMLOKGGQLLDTKLNGIL 
Replicase 1a : 



CAACAGGCAGTTCTTGATTATCTTGCTACAGTTAAAACTGTACCAGCTGGTAATTTGGTTAAACTTGTTGTTGAGAGTTGTACCATTTATAT 

_h 1 1 1 ' i 1 1 1 I 1— 1~ 1 1 — H 1 4784 

GTTGTCCGTCAAGAACTAATAGAACGATGTCAATTTTGACATGGTCGACCATTAAACCAATTTGAACAACAACTCTCAACATGGTAAATATA 

QQAV L 0 Y LAT. VKTVPAGNt'vX LVVE SC T IYM 
^Replicase 1a — 



GTGTGTTGTACCATCGATAAATGATCTTTCTTTTGATAAAAATCTTGGTCGTTGTGTGCGTAAACTTAATAGATTGAAAACTTGTGTTATTG 

h h 1 1 " ■ ■ i 1 "t I i ■ ■ I h 1 I I h- 4876 

CACACAACATGGTAaCTATTTACTAGAAAGAAAACTATTTTTAGAACCAGCAACACACGCATTTGAATTATCTAACTTTTGAACACAATAAC 

CVVPS I N 0 L SFDK NLGRCVRK L NR-LK TCV I 
Replicase 1 a 



CCAATGTTCCTGCTATTGATGTTTTGAAAAAGCTTCTTTCAAGTTTGACTTTAACTGTTAAATTTGTTGTAGAGAGTAATGTTATGGATGTT 

1 1 *-H 1 1 1 1 ' • ■ ■ H 1 1 1 1 ■ i ■ I ■ i 1 ■ i ■ ' 4968 

GGTTACAAGGACGATAACTACAAAACTTTTTCGAAGAAAGTTCAAACTGAAATTGACAATTTAAACAACATCTCTCATTACAATACCTACAA 

ANVPA IOVLKKILSSLT LTVKFVVESNVMOV 
Replicase 1a : ■ 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 



7/87 



PCT/NL2004/000805 



AACGACTGTTTTAAGAATGATAATGTAGTTTTGAAAATTACTGAAGATGGTATTAATGTTAAAGATGTTGTTGTTGAGTCTTCTAAGTCACT 

-H 1 1 I 1 1 1 1 1 1 1 1 1 — = — I i I 1 *+ 5060 

TTGCTGACAAAATTCTTACTATTACATCAAAACTTTTAATGACTTCTACCATAATTACAATTTCTACAACAACAACTCAGAAGATTCAGTGA 

NDCFKNONVVLK I TEQG ! NVKDVVVESSK Sl 
— Replicase 1a : 



TGGTAAACAATTGGGTGTTGTGAGTGATGGTGTTGACTCTTtTGAAGGTGTTTTACCTATTAATACTGATACTGTCTTATCTGTAGCTCCAG 

1 , | , 1 , j ^ 1 1 < 1 1 i I 1 ' 1 • H— 5152 

ACCATTTGTTAACCCACAACACTCACTACCACAACTGAGAAAACTTCCACAAAATGGATAATTATGACTATGACAGAATAGACATCGAGGTC 

GKQLGVVSOGVOSFEGVLP INTOTVLSVAP 
— Replicase 1a 



AAGTTGACTGGGTTGCTTTTTACGGTTTTGAAAAGGCAGCACTTTTTGCTTCTTTGGATGTAAAGCCATATGGTTACCCTAATGATTTTGTT 

_ 1 , | . 1 1 1 i 1 i 1 1 I 1 ~H 1 1 5244 

TTCAACTGACCCAACGAAAAATGCCAAAACTTTTCCGTCGTGAAAAACGAAGAAACCTACATTTCGGTATACCAATGGGATTACTAAAACAA 

EVOWVAFYGFEKAALFASLDVKPYGYPNDFV 
Replicase 1a — 



GGTGGTTTTAGAGTTCTTGGGACCACCGACAATAATTGTTGGGTTAATGCAACTTGTATAATTTTACAGTATCTTAAGCCTACTTTTAAATC 

h I , [— , 1 1 " 1 1 i ■ i 'i i | > ■ ' I \ 5336 

CCACCAAAATCTCAAGAACCCTGGTGGCTGTTATTAACAACCCAATTACGTTGAACATATTAAAATGTCATAGAATTCGGATGAAAATTTAG 

GGFRVLGTTDNNCWVNATCI ILQYLKPTFKS 
Replicase 1a- — — ■ 



TAAGGGTTTAAATGTTCTTTGGAACAAATTTGTTACAGGTGATGTTGGACCTTTTGTTAGTTTTATTTATTTTATAACTATGTCTTCAAAGG 

— «—H I < 1 » ■ I ■ 1 1 1 1 1 1 1 a 1 1 5q28 

ATTCCCAAATTTACAAGAAACCTTGTTTAAACAATGTCCACTACAACCTGGAAAACAATCAAAATAAATAAAATATTGATACAGAAGTTTCC 

KGLNVLWNKFVTGDVGPFVSF I YF I TMSSK 
— 1 Replicase 1a •■ — ' r : 



GTCAAAAGGGTGATGCTGAAGAGGCATTATCTAAATTGTCAGAGTATTTGATTAGTGATTCTATTGTTACTCTTGAACAATATTCAACTTGT 

■H I 1 1 1 -I 1 1 1 ' 1 I I I ■ h h 5520 

CAGTTTTCCCACTACGACTTCTCCGTAATAGATTTAACAGTCTCATAAACTAATCACTAAGATAACAATGAGAACTTGTTATAAGTTGAACA 

GOKGDAEEALSKLSEYL 1 SOSIVTLEQYSTC 
Replicase 1a 



GACATTTGTAAAAGTACTGTAGTTGAAGTTAAAAGTGCTGTTGTCTGT6CTAGTGTGCTTAAAGATGGTTGTGATGTTGGTTTTTGTCCACA 

, 1 , 1 1 1 1 1 \ ■ ■ 1 1 1 1 1 H H — 5612 

CTGTAAACATTTTCATGACATCAACTTCAATTTTCACGACAACAGACACGATCACACGAATTTCTACCAACACTACAACCAAAAACAGGTGT 

OICKSTVVEVKSAVVCASVLKOGCDVGFCPH 
Replicase 1a 



CAGACATAAATTGCGTTCACGTGTTAAGTTTGTTAATGGACGTGTTGTTATTACCAATGTTGGTGAACCTATAATTTCACAACCTTCTAAGT 

— , 1 , H~ 1 h " H I h 1 m~ 1 I 1 5704 

GTCTGTATTTAACGCAAGTGCACAATTCAAACAATTACCTGCACAACAATAATGGTTACAACCACTTGGATATTAAAGTGTTGGAAGATTCA 

RHKLRSRVKFVNGRVVI TNVGEP I ISQPSK 

Replicase 1a 



TGCTTAATGGTATTGCTTATACAACATTTTCAGGTTCTTTTGATAACGGTCACTATGTAGTTTATGATGCTGCTAATAATGCTGTCTATGAT 

■ » ■■■ ! i I I ■ ■ i -I ■ i " I | ■ ... i .... | | i I ■ i ■ 5796 

ACGAATTACCATAACGAATATGTTGTAAAAGTCCAAGAAAACTATTGCCAGTGATACATCAAATACTACGACGATTATTACGACAGATACTA 

LLNG I AYTTFSGSFONGHYVVYDAANNAV Y 0 
■ Replicase 1a 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

8/87 



GGTGCTCGTTTATTTGCTTCAGATTTGTCTACTTTAGCTGTTACAGCTATTGTTGTAGTAGGTGGTTGTCTAACATCTAATGTTCCACCAAT 

—I 1 1 ' 1 ' 1 1 1 1 f- < i i l i 5888 

CCACGAGCAAATAAACGAAGTCTAAACAGATGAAATCGACAATGTCGATAACAACATCATCCACCAACACATTGTAGATTACAAGGTGGTTA 

GARLF ASDLS TLAVT A I VVVGGC VTSNV P" P I 
Replicase 1a — : 



TGTTAGTGAGAAAATTTCTGTTATGGATAAACTTGATACTGGTGCACAAAAATTTTTCCAATTTGGTGATTTTGTTATGAATAACATTGTTC 

■H • 1 • 1 • 1 1 1 i ■ I ■ ■ ■ i 1 I 1 1 * K 5980 

ACAATCACTCTTTTAAAGACAATACCTATTTGAACTATGACCACGTGTTTTTAAAAAGGTTAAACCACTAAAACAATACTTATTGTAACAAG 

VSEK I SVMOKLOTGAQKFFOFGOFV MNN I V 
Replicase 1a ' 



TGTTTTTAACTTGGTTGCTTAGTATGTTTAGTCTTTTACGTACTTCTATTATGAAGCATGATATTAAAGTTATTGCCAAGGCTCCTAAACGT 

, 1 , 1 , . > ~h . 1 ■ ■ > 1 1 i < 1 1 ■ ' ■ I 6072 

ACAAAAATTGAACCAACGAATCATACAAATCAGAAAATGCATGAAGATAATACTTCGTACTATAATTTCAATAACGGTTCCGAGGATTTGCA 

LFL TWLLSMFSLLRTS1 MKHO I KV I AKAPKR 
Replicase 1a — 



ACAGGTGTTATTTTGACACGTAGTTTTAAGTATAACATTAGATCTGCTTTGTTTGTTGTAAAGCAGAAGtGGTGTGTTATTGTTACTTTGTT 

— h 1 1 1 1 1 ■■■ | 1 1 ■ ' i i i i 1 i ■ i ~H 1 ■ I ■ ■ i 1 6I64 

TGTCCACAATAAAACTGTGCATCAAAATTCATATTGTAATCTAGACGAAACAAACAACATTTCGTCTTCACCACACAATAACAATGAAACAA 

TGV I L TRSFKYNI RSALFVVKO K. WCVI VTLF 
= Replicase 1a 



TAAGTTCTTATTGTTATTATATGCTATTTATGCACTTGTTTTTATGATTGTGCAATTTAGTCCTTTTAATAGTCTTTTATGTGGTGACATTG 

H 1 , h 1 1 1 H ■* < — i H *~ H I ' 1 r-H 6256 

ATTCAAGAATAACAATAATATACGATAAATACGTGAACAAAAATACTAACACGTTAAATCAGGAAAATTATCAGAAAATACACCACTGTAAC . 

KFLLLLYA I YALVFMIVQFS PF. NSLUCGO I 
: Replicase 1a ^ r-r 



TAAGTGGTTATGAAAAATCCACTTTTAATAAGGATATTTATTGTGGTAATTCTATGGTTTGTAAGATGTGTTTGTTTAGTTATCAAGAGTTT 

1 I . 1 1 1 | 1 ■ ■ I 1 \ I ~n 6348 

ATTCACCAATACTTTTTAGGTGAAAATTATTCCTATAAATAACACCATTAAGATACCAAACATTCTACACAAACAAATCAATAGTTCTCAAA 

VSGYEK'ST F N K 0 I YCGNSMVCKMCLFSYQEF 
•■ Replicase 1a ; 



AATGATTTGGATCATACTAGTCTTGTTTGGAAGCACATTCGTGATCCTATATTAATCAGTTTACAACCATTTGTTATACTTGTTATTTTGTT 

H I 1 ' 1 ■ I 1 1 I h 1 1 > i | 1 1 1 1 h 6440 

TTACTAAACCTAGTATGATCAGAACAAACCTTCGTGTAAGCACTAGGATATAATTAGTCAAATGTTGGTAAACAATATGAACAATAAAACAA ' 

NO LOHTSLVWKHIROPILISLQPFVItV ILL 
Replicase 1a 



AATTTTTGGTAATATGTATTTGCGTTTTGGACTTTTATATTTTGTTGCACAATTTATTAGTACTTTTGGTTCTTTCTTAGGCTTTCATCAGA 

____ 1 ■ i ■ ■ | t 1 * } 1 1 1 1 t I ' ■ i 1 — 6532 

TTAAAAACCAT'TATACATAAACGCAAAACCTGAAAATATAAAACAACGTGTTAAATAATCATGAAAACCAAGAAAGAATCCGAAAGTAGTCT 

1 FGNMYLRFGLLY'FVAOF I STFGSFLGFHQ 
■• Replicase 1a 



AACAGTGGTTTTTACATTTTGTGCCGTTTGATGTTTTATGTAATGAGTTTTTAGCTACATTTATTGTCTGCAAAATTGTTTTATTTGTTAGA 

I 1 1 ■ -1 1 1 1 1 1 1 1 ' I I ' ■ ■ i H— - 6624. 

TTGTCACCAAAAATGTAAAACACGGCAAACTACAAAATACATTACTCAAAAATCGATGTAAATAACAGACGTTTTAACAAAATAAACAATCT 

KQWF L HF V PF OVLC NE F LA T F | V C K IVLFVR 
— Replicase 1a • 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

9/87 



CATATTATTGTTGGCTGTAATAAT^CTGACTGTGTAGCTTGTTCTAAAAGTGCTAGACTTAAACGTGTACCACTTCAAACTATTATTAATGG 

h 1 " 1 1 1 1 H— < h 1 1 ' 1 ■ ' • i 1 1 1 H- 6716 

GTATAATAACAACCGACATTATTACGACTGACACATCGAACAAGATTTTCACGATCTGAATTTGCACATGGJGAAGTTTGATAATAATTACC 

hi ivgcnn'adcvacsxsarlkrvplqti IN G 
'• Replicase 1a 1 ! 



TATGCATAAATCATTCTATGTTAATGCTAATGGTGGTACTTGTTTCTGTAATAAACATAACTTCTTTTGTGTTAATTGTGATTCTTTTGGGC 

1 . ■ h i I i I i 1 ■ i ■ I I 1 I 1 ■ 1 \ 6808 

ATACGTATTTAGTAAGATACAATTACGATTACCACCATGAACAAAGACATTATTTGTATTGAAGAAAACACAATTAACACTAAGAAAACCCG 



MHKSFYVNANGGTCFCNKHNFFCVNCDSFG 
■ Replicase 1a : 



CTGGTAATACTTTTATTAATGGTGATATTGCAAGAGAGCTTGGTAATGTTGTTAAAACAGCTGTTCAACCCACAGCT.CCTGCATATGTTATT 

■ I | I > l I I " 1 i I ' 1 I 1 1 ' I 6900 

GACCATTATGAAAATAATTACCACTATAACGTTCTCTCGAACCATTACAACAATTTTGTCGACAAGTTGGGTGTCGAGGACGTATACAATAA 

PGNTF I NGO 1 ARELGNVVKTAVOPTAPAYVl 
Replicase 1a 



ATTGATAAGGTAGATTTTGTTAATGGATTTTATCGTCTTTATA&TGGTGACACTTTTTGGCGGTATGACTTTGACATTACTGAATCTAAGTA 

1 1 1 1 1 1 . 1 1 f 1 ' I — H " 1 1 — 6992 

TAACTATTCCATCTAAAACAATTACCTAAAATAGCAGAAATATCACCACTGTGAAAAACCGCCATACTGAAACTGTAATGACTTAGATTCAT 

I OKVO FVNGF YRLYSGOTFWRYOF D I TESKY 
— ■ — Replicase 1a — 



TAGTTGTAAAGAGGTTCTGAAGAATTGTAATGTTTTAGAAAATTTTATTGTTTACAATAATAGTGGTAGTAACATTACACAGATTAAAAATG 

— h 1 i 1 h 1 1 1 * 1 I i 1 ~H 1 ' ' ' tl " I 7084 

ATCAACATTTCTCCAAGACTTCTTAACATTACAAAATCTTTTAAAATAACAAATGTTATTATCACCATCATTGTAATGTGTCTAATTTTTAC 

SCKEVLKNCN.VLENF I VYNNSGSN I TQ I KN 
— Replicase 1 a 1 ; 



CTTGTGTTTATTTTTCTCAATTGTTGTGTGAACCTATAAAGTTGGTAAATTCAGAGTTGTTGTCAACTTTATCAGTTGATTTTAATGGTGTT 

h , , 1 1 1— h 1 . 1 1 hH 1 ■ \ . , ■ i 1 1 H~ 7176 

GAACACAAATAAAAAGAGTTAACAACACACTTGGATATTTCAACCATTTAAGTCTCAACAACAGTTGAAATAGTCAACTAAAATTACCACAA 

ACVY F SOLL CEP 1KIVNSELLSTLSVDFNGV 
— Replicase 1 a — — 



TTGCATAAGGCATATGTTGATGTTTTGTGTAATAGTTTTTTTAAGGAGCTAACTGCTAACATGTCCATGGCTGAATGTAAAGCTACACTTGG 

H H . 1 h \ . ~H 1 H h 1 I 1 ' 7268 

AACGTATTCCGTATACAACTACAAAACACATTATCAAAAAAATTCCTCGATTGACGATTGTACAGGTACCGACTTACATTTCGATGTGAACC 

LHKAYVOVLCNSFFKELTANMSMAECKATLG 
— — Replicase 1a ; 1 



TTTGACTGTTTCTGATGATGATTTTGTTTCAGCTGTTGCCAATGCACATAGGTATGACGTTTTGCTTTCAGATTTGTCATTTAATAATTTIT 

.| , ■ , .« 1 1 i 1 1 1 1 1 1 1 1 1 1 I ' 1 h 7360 

aaactgacaaagactactactaaaacaaagtcgacaacggttacgtgtatccatactgcaaaacgaaagtctaaacagtaaattattaaaaa 
l tvsooof'vsavahahryov.ll sol sfn n f 

■ Replicase 1a : 



TTATTTCTTATGCTAAACCTGAAGATAAGTTGTCCGTTTATGACATTGCTTGTTGTATGCGTGCCGGTTCTAAGGTTGTTAACCATAATGTT 

h— i . 1 »— I 1 1 > ...| " I * 1 \ I i I — 7452 

AATAAAGAATACGATTTGGACTTCTATTCAACAGGCAAATACTGTAACGAACAACATACGCACGGCCAAGATTCCAACAATTGGTATTACAA 

FISYAKPE.OKLSVYOIACCMRAG. SKVVN H N V 
Replicase 1a * 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

10/87 . 



TTAATCAAAGAGTCAATACCTATTGTTTGGGGTGTCAAGGACTTTAATACTCTTTCTCAAGAAGGTAAGAAGTACCTTGTTAAAACAACTAA 

— < 1 1 1 1 1 1 1 1 1 1 ■ ■ ■ ■ I 1— H 1 I 754Q 

AATTAGTTTCTCAGTTATGGATAACAAACCCCACAGTTCCTGAAATTATGAGAAAGAGTTCTTCCATTCTTCATGGAACAATTTTGTTGATT 

L I KES 1 P I VWGVK OF N T I SO E G K K Y L V K T'T K 
Replicasa 1a — ^— — — ^— — 



AGCAAAGGGTTTGACTTTTTTATTAACTTTTAATGATAACCAAGCAATTACACAAGTTCCTGCTACTAGTATAGTTGCAAAACAGGGTGCTG 

h \ 1 1 1 1 I i ■ I m 1 1 1 i I i 1 7636 

TCGTTTCCCAAACTGAAAAAATAATTGAAAATTACTATTGGTTCGTTAATGTGTTCAAGGACGATGATCATATCAACGTTTTGTCCCACGAC 

AKGL TFLLTFN ONQA I TGVPATS I VAKQG A 
Replicasa 1a : 



GTTTTAAACGTACTTATAATTTTCTGTGGTATGTATGTTTATTTGTTGTTGCATTGTTTATTGGTGTCTCATTTATTGATTATACAACCACT 

1 , 1 , j , i 1 1 1 | 1 1 > I ' 1 I i 7728 

CAAAATTTGCATGAATATTAAAAGACACCATACATACAAATAAACAACAACGTAACAAATAACCACAGAGTAAATAACTAATATGTTGGTGA 

GFKR TYN'FLWYVCLFVVALF I GVSF I DYTTT 
— Replicasa 1a : 



GTAACTAGCTTTCATGGTTATGATTTTAAGTACATTGAGAATGGTCAGTTGAAGGTGTTTGAAGCACCTTTACACTGTGTTCGTAATGTTTT 

_l m 1 1 ■ I 1 1 1 • 1 1 : ■ ■ I ■ ■ ■ i H 1 1 h 7820 

CATTGATCGAAAGTACCAATACTAAAATTCATGTAACTCTTACCA6TCAACTTCCACAAACTTCGTGGAAATGTGACACAAGCATTACAAAA 

VTSFHGYDFKYIENGOLKVFE A. PLHC VRNVF 
Replicasa 1a : 



TGATAATTTTAATCAATGGCATGAGGCTAAGTTTGGTGTTGTTACTACTAATAGTGATAAAT.GTCCTATAGTTGTTGGTGTTTCAGAGCGTA 

, , 1 , 1 h~ 1 , 1 , 1 ' * 1 1 i=-H H — 7912 

ACTATTAAAATTAGTTACCGTACTCCGATTCAAACCACAACAATGATGATTATCACTATTTACAGGATATCAACAACCACAAAGTCTCGCAT 

DNF N.QWHEAKFGVVT T NSOK C P I VVG V SER 
■ Replicasa 1 a : 



TTAATGTTGTTCCTGGTGTTCCAACAAATGTATATTTGGTAGGAAAGACTCTtGTTTTTACATTACAGGCTGCTTTTGGAAACACAGGTGTT 

— i , i ■ I 1 j h— H 1 i—i 1 , , h J— I 8004 

AATTACAACAAGGACCACAAGGTTGTTTACATATAAACCATCCTTTCTGAGAACAAAAATGTAATGTCCGACGAAAACCTTTGTGTCCACAA 

INVVPGVPtNVYLVGKTLVFTLOAAFG NTGV 
Replicasa la 



TGTTATGACTTTGATGGTGTTACCACTAGTGATAAGTGTATTTTTAATTCTGCTTGTACTAGGTTGGAAGGTTTGGGTGGTGACAATGTTTA 

h 1 1 j 1 y 1 1 ) >-H I 1 1 I 1 h— h- 8096 

ACAATACTGAAACTACCACAATGGTGATCACTATTCACATAAAAATTAAGACGAACATGATCCAACCTTCCAAACCCACCACTGTTACAAAT 

CYOF DGV T TSDKC IFNSACTRLEGLGGONVY 
Replicasa 1a 



TTGTTACAACACTGATCTTATTGAAGGTTCTAAACCTTATAGTATTTTACAGCCCAATGCTTATTATAAGTATGATGTTAAAAATTATGTAC 

■ \ 1 \ I 1 ~H 1 1 H 1 -I \ 8188 

AACAATGTTGTGACTAGAATAACTTCCAAGATTTGGAATATCATAAAATGTCGGGTTACGAATAATATTCATACTACAATTTTTAATACATG 

CYNTOL I EGSKPYSILOPNAYYKYDVKNYV 
— '■ Replicasa 1 a 1 



GTTTTCCAGAAATTTTAGCTAGAGGTTTTGGCTTACGTACTATTAGAACTTTGGCTACACGTTATTGTAGAGTTGGTGAATGCCGTGACTCA 

-H « 1 1 1 1 1 ■ ♦ >" I' ' 1 1 ' ' \ I ' — • — h 8280 

CAAAAGGTCTTTAAAATCGATCTCCAAAACCGAATGCATGATAATCTTGAAACCGATGTGCAATAACATCTCAACCACTTACGGCACTGAGT . 

RFPEILARGFGLRTIRTLATRYCRVGECRDS 

■ Replicasa 1a 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

11/87 



CATAAAGGTGTTTGTTTTGGTTTTGATAAATGGTATGTTAATGATGGACGTGTTGATGACGGTTACATTTGTGGTGATGGTCTTATAGACCT 

. 1 1 ~H 1 1 . 1 . 1 1 1 1 \ • 1 — 8372 

GTATTTCCACAAACAAAACCAAAACTATTTACCATACAATTACTACCTGCACAACTACTGCCAATGTAAACACCACTACCAGAATATCTGGA 

HKGVCFGFOKWYVNDGRVDOGY ! CGOGL IOL 
• Repllcase 1a 1 



TCTTGTTAATGTACTCTCAATCTTTAGTTCATCTTTTAGCGTTGTGGCTATGTCTGGACATATGTTGTTTAATTTTCTTTTTGCAGCATTTA 

1 1 1 h 1 _ 1 1 1 1 1 1 H 1 1 8Q64 

AGAACAATTACATGAGAGTTAGAAATCAAGTAGAAA'ATCGCAACACCGATACAGACCTGTATACAACAAATTAAAAGAAAAACGTCGTAAAT 

LVNVLS IFSSSFSVVAMSGHMlFNFLFAAF 

Replicase 1a 



TTACATTTTTGTGCTTTTTAGTTACTAAATTTAAACGTGTTTTTGGTGATCTTTCTTATGGTGTTTTTACTGTTGTTTGTGCAACTTTGATT 

„ 1 , 1 1 I 1 1 > I i 1 h H 1 H 1 1 h- 8556 

AATGTAAAAACACGAAAAATCAATGATTTAAATTTGCACAAAAACCACTAGAAAGAATACCACAAAAATGACAACAAACACGTTGAAACTAA 

ITFLCFLVTKFKRVFGDLSYGVFTVVCATLI 
Repllcase 1a 



AATAACATTTCTTATGTTGTTACTCAAAATTTATTTTTTATGTTGCTTTATGCTATTTTGTATTTTGTTTTTACTAGGACAGTGCGTTATGC 

f , 1 _ 1 ~ < h < 1 1 1 ' I I » I 1 8648 

TTATTGTAAAGAATACAACAATGAGTTTTAAATAAAAAATACAACGAAATACGATAAAACATAAAACAAAAATGATCCTGTCACGCAATACG 

NN I SYVVTONLFFMLLYA I LYFVFT RTVRY A 
Re plica se 1a — ' 



TTGGATTTGGCATATTGCATACATTGTTGCATACTTCTTGTTAATACCATGGTGGCTTCTCACATGGTTTAGTTTTGCTGCATTTTTAGAGC 

H 1 1 ■ ■ ■ i 1 1 1 1 1 i H > ■■■>■ ! ■■ ■■ 1 ' ' ■ ' h- 1 1 H 8740 : 

AACCTAAACCGTATAACGTATGTAACAACGTATGAAGAACAATTATGGTACCACCGAAGAGTGTACCAAATCAAAACGACGTAAAAATCTCG 

WIWH I A Y I VAYFLL I PWWLLTWFSFAAF LE 
rRepHcase 1a ; 



TTTTACCTAATGTTTTTAAGTTAAAAATCTCTACTCAATTGTTTGAAGGTGATAAGTTTATAGGTACTTTTGAGAGTGCTGCTGCAGGTACA 

, 1 h 1 — < 1 I h m 1 1 H h 1 1 1 — 8832 

AAAATGGATTACAAAAATTCAATTTTTAGAGATGAGTTAACAAACTTCCACTATTCAAATATCCATGAAAACTCTCACGACGACGTCCATGT 

LLPNVFKLK I STOLFE GDKF I GTFESAAAGT 
1 Repllcase 1a 



TTTGTTCTTGACATGCGTTCTTATGAAAGGCTGATAAATACTATTTCACCTGAGAAACTTAAGAATTATGCTGCAAGTTATAATAAATATAA 

I 1 1 » I , I ' ' ■ 1 I 1 1 < 1 I ' h I I \ I ■ ■ ■ ■ 8924 

AAACAAGAACTGTACGCAAGAATACTTTCCGACTATTTATGATAAAGTGGACTCTTTGAATTCTTAATACGACGTTCAATATTATTTATATT 

FVLDMRSYE R L INTI SPEKLKNYAASYNKYK 
: — Replicase 1a 



ATATTATAGTGGTAGTGCTAGTGAGGCTGATTATCGTTGTGCTTGTTATGCTCATTTAGCCAAGGCTATGTTAGATTACGCAAAAGATCATA 

h 1 1 1 1 h 1 1 'H I 1 1 " 1 9016 

TATAATATCACCATCACGATCACTCCGACTAATAGCAACACGAACAATACGAGTAAATCGGTTCCGATACAATCTAATGCGTTTTCTAGTAT 

YYSGSASEAOYRCACYAH- LAKAMLOYAKOH 
'• — Replicase 1a '• 



ATGACATGTTATATTCTCCACCTACCATTAGCTACAATTCCACCTTACAATCTGGTCTTAAGAAGATGGCACAACCATCTGGTTGTGTTGAG 

1 , 1 . 1 1 1 1 1 1 ! < H 1 1 1 1 9I08 

TACTGTACAATATAAGAGGTGGATGGTAATCGATGTTAAGGTGGAATGTTAGACCAGAATTCTTCTACCGTGTTGGTAGACCAACACAACTC 

NDMLYSPPT ISYNSTLOSGLKKMAOPSGCVE 
- " ' . — Repllcase 1a : . 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

12/87 



AGATGTGTGGTTCGCGTCTGTTATGGTAGTACTGTGCTTAATGGAGTTTGGTTAGGTGACACTGTTACTTGTCCTAGACATGTCATAGCACC 

— H « 1 1 1 1 1 ' 1— -h 1 h— — ' 1 m ~H m h 9200 

TCTACACACCAAGCGCAGACAATACCATCATGACACGAATTACCTCAAACCAATCCACTGTGACAATGAACAGGATCTGTACAGTATCGTGG 

RCVVRVC YGSTVLNGVWLGDTVTCPRHV I AP 
— — Replicase 1a ■ — 



ATCAACCACTGTTCTTATTGATTATGATCATGCATATAGTACTATGCGTTTGCATAATTTTTCAGTGTCTCATAATGGTGTCTTCTTGGGAG 

I ' I ■ 1 I ■ ' i I > I . i 1 i " I 1 ■ ■ ' I 9292 

TAGTTGGTGACAAGAATAACTAATACTAGTACGTATATCATGATACGCAAACGTATTAAAAAGTCACAGAGTATTACCACAGAAGAACCCTC 

STTVL I OYDHAYSTMRLHNFSVSHNGVFLG 
Replicase 1a 



TTGTTGGTGTTACAATGCATGGTTCTGTGTTGCGTATTAAGGTTTCACAATCTAATGTACATACACCTAAACATGTTTTTAAAACGTTGAAA 

— i 1 1 1 1 \ h~ 1 • I 1 1 1 1 ~h 1 938a 

AACAACCACAATGTTACGTACCAAGACACAACGCATAATTCCAAAGTGTTAGATTACATGTATGTGGATTTGTACAAAAATTTTGCAACTTT 

VVGVTNHGSVlRl KVSQSNVHTPKHVFKTLK 
; Replicase 1a : 



CCTGGTGCTTCTTTTAATATTTTAGCATGTTATGAAGGTATTGCATCTGGTGTTTTTGGTGTTAATTTACGTACAAACTTTACTATTAAAGG 

h 1 . 1 . 1 i 1 i 1 H 1 1 " 1 ' 1 9476 

GGACCACGAAGAAAATTATAAAATCGTACAATACTTCCATAACGTAGACCACAAAAACCACAATTAAATGCATGTTTGAAATGATAATTTCC 

PGASFN I L A C Y £ G I ASGVFGVNLRTNFT 1 KG 
— Replicase 1a ; 



TTCTTTTATAAATGGAGCTTGTGGTTCTCCTGGTTATAATGTTAGAAATGATGGTACTGTTGAGTTTTGTTATTTACACCAAATTGAGTTAG 

1 1 1 h~ 1 1 H 1 1 1 1 h h- 1 It I 1 i~r 9568 

AAGAAAATATTTACCTCGAACACCAAGAGGACCAATATTACAATCTTTACTACCATGACAACTCAAAACAATAAATGTGGTTTAACTCAATC 

SF I NGAC G5PGYNVRN0GT VEF CYLHO 1 EL 
^Replicase la : : - 



GTAGTGGTGCTCATGTTGGTTCTGATTTTACTGGTAGTGTTTATGGTAATTTTGATGACCAACCTAGTTTGCAAGTTGAGAGTGCCAACCTT 

■ I * 1 1 1 1 1 ■ i 1 1 1 1 I 1 1 " 1 I » 1* 9660 

CATCACCACGAGTACAACCAAGACTAAAATGACCATCACAAATACCATTAAAACTACTGGTTGGATCAAACGTTCAACTCTCACGGTTGGAA 

GSGAHVGSDFTG5VYGNF00QPSLGVESA NL 
■ : Replicase 1a 



ATGCTATCAGATAATGTTGTTGCCTTTTTGTATGCTGCTTTGTTGAATGGTTGTAGGTGGTGGTTGCGTTCAACTAGAGTTAATGTTGATGG 

, | H 1 1 -h h~ 1 ■ 1 M~ 1 1 1 I 1 1 — 9752 

TACGATAGTCTATTACAACAACGGAAAAACATACGACGAAACAACTTACCAACATCCACCACCAACGCAAGTTGATCTCAATTACAACTACC 

MLSDNVVAFLYAALLNGCRWWLRSTRVNVOG 
— * Replicase 1a 



TTTTAATGAATGGGCTATGGCTAATGGTTATACAATTGTTTCTAGTGTTGAGTGCTATTCTATTTTGGCAGCAAAAACTGGTGTTAGTGTTG 

1 ,, i 1 1 1 h 1 1 ~n 1 1 1 1 — H 1 " 1 I 9844 

AAAATTACTTACCCGATACCGATTACCAATATGTTAACAAAGATCACAACTCACGATAAGATAAAACCGTCGTTTTTGACCACAATCACAAC 

FNEWAMANGYT I VS. SVECYS 1LAAKTGVSV 
; Replicase 1a 



AACAATTGTTAGCTTCCATTCAACATCTTCATGAAGGTTTTGGTGGTAAAAACATACTTGGTTATTCTAGTTTATGTGATGAGTTCACACTA 

A 1 . 1 1 1 — | ■ i 1 1 1 ■ i 1 i 11 I 1 ' i 1 9936 

TTGTTAACAATCGAAGGTAAGTTGTAGAAGTACTTCCAAAACCACCATTTTTGTATGAACCAATAAGATCAAATACACTACTCAAGTGTGAT 

EQLLAS I OHLHEGFGGK NI LGYSSLCOEFTL 
Replicase 1a • •■ 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 



13/87 



PCT/NL2004/000805 



6CTGAAGTTGTGAAGCAGATGTATGGTGTTAACTTBCAAAGTGGTAAGGTTATTTTTGGTTTAAAAACAATGTTTTTATTTAGCGTTTTCTT 

1 , 1 1 H 1 1 ► | -H 1 h- ■ 1 ; ' ■ I ■ ' i ■ ■ ■ I 10028 

CGACTTCAACACTTCGTCTACATACCACAATTGAAC6TTTCACCATTCCAATAAAAACCAAATTT.TTGTTACAAAAATAAATCGCAAAAGAA 

AEVVK QMYGVNLOSGKV I FGLKTHFLFSVFF 
— — ^ — Replicase 1a 



CACAATGTTTTGGGCAGAACTCTTTArTTTATACAAACACTATATGGATAAACCCTGTTATACTTACACCTATATTTTGTTTACTTTTGTTTT 

H 1 1 m 1 1 1 1 1 I 1 H 1 1 • 1 h h 10120 

GTGTTACAAAACCCGTCTTGAGAAATAAATATGTTTGTGATATACCTATTTGGGACAATATGAATGTGGATATAAAACAAATGAAAACAAAA 



TflFWAELF I YTNTIWI N.PV ILTP 1 FCLLLF 
— Replicase 1a 



TGTCATTAGTTTTAACTATGTTTCTTAAACATAAGTTTTTGTTTTTGCAAGTATTTTTATTACCTACTGTTATTGCAACTGCTTTATATAAT 

, 1 . 1 1 1 1 1 h 1 ~h 1 ^ 1 . j 1 1— 10212 

ACAGTAATCAAAArTGATACAAAGAATTTGTATTCAAAAACAAAAACGTTCATAAAAATAATGGATGACAATAACGTTGACGAAATATATTA 

LSLVLTMFLKHKFLFLQVFLLPTVIATALY-N 
: Replicase 1a : : = 



TGTGTTTTGGATTATTACATAGTAAAATTTTTGGCTGACCATTTTAACTATAATGTTTCAGTATTACAAATGGATGTTCAGGGTTTAGTTAA 

~4 1 ~h f 1 1 1 i 1 1 1 H h ' - 'H H < I 10304 

ACACAAAACCTAATAATGTATCATTTTAAAAACCGACTGGTAAAATTGATATTACAAAGTCATAATGTTTACCTACAAGTCCCAAATCAATT 

CVLDYY I VKFLADHFNYNVSVLQMDVOGLVN 
— Replicase 1a 



TGTTTTGGTCTGTTTATTTGTTGTATTTTTACACACATGGCGTTTTTCTAAAGAACGTTTCACACATTGGTTTACATATGTGTGTTCTCTTA 

^ 1 1 H 1 1 1 ~+~ 1 1 > I H ~* 1 1 ' ' ' » »- 10396 

ACAAAACCAGACAAATAAACAACATAAAAATGTGTGTACCGCAAAAAGATTTCTTGCAAAGTGTGTAACCAAATGTATACACACAAGAGAAT 

VLVCLFVVFLHTWRFSKERFTH WFTYVCSL 
: Replicase 1a ? : 



TAGCAGTTGCTTACACTTATTTTTATAGTGGTGACTTTTTGAGTTTGCTTGTTATGTTTTTATGTGCTATATCTAGTGATTGGTACATTGGT 

1 , 1 1 1 1 h • 1 \ I- 1 1 1 » 10188 

ATCGTCAACGAATGTGAATAAAAATATCACCACTGAAAAACTCAAACGAACAATACAAAAATACACGATATAGATCACTAACCATGTAACCA 

1 AVAY TYFYSGOFLSLLVMFLCA I SSOWY 1G 
■' Replicase 1a— — — 



GCCATTGTTTTTAGGTTGTCACGTTTGATTATATTTTTTTCACCTGAAAGTGTATTTAGTGTTTTTGGTGATGTGAAACTCACTTTAGTTGT 

■ , 1 1 1 1 1 1 1 h 1 1 I I 10580 

CGGTAACAAAAATCCAACAGTGCAAACTAATATAAAAAAAGTGGACTTTCACATAAATCACAAAAACCACTACACTTTGAGTGAAATCAACA 

AI VFRLSRL I I FF SPESVFSVFGDVKLTLVV 
■ Replicase 1a ; 



TTATTTAATTTGTGGTTATTTAGTTTGTACTTATTGGGGCATTTTGTATTGGTTCAATAGGTTTTTTAAATGTACTATGGGTGTTTATGATT. 

, 1 1 1 I ■ ■ ■ i 1 1 H- 1 h~ 1 1 1 ■ 1 i 1 — 10672 

AATAAATTAAACACCAATAAATCAAACATGAATAACCCCGTAAAACATAACCAAGTTATCCAAAAAATTTACATGATACCCACAAATACTAA 

YL I CGYLVCTYW GILYWFNRFFKCTMGVYO 
Replicase 1a— 



TTAAGGTGAGTGCTGCTGAATTTAAATACATGGTTGCTAATGGACTTCATGCACCATATGGACCTTTTGATGCACTTTGGTTATCATTCAAA 

—4 1 ■ i ■ t | I i | i 1 ■ 1 I I I i I 1 1 1 10764 

AATTCCACTCACGACGACTTAAATTTATGTACCAACGATTACCTGAAGTACGTGGTATACCTGGAAAACTACGTGAAACCAATAGTAAGTTT 

FKVSAAEFKYMVANGLHAPYGPFDALWLS F K 
— Replicase 1a- ; 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 



14/87 



PCT/NL2004/000805 



TTACTTGGTATTGGTGGTGACCGTTGTATAAAAATTTCAACTGTCCAATCCAAACTGACTGATTTGAASTGTACTAATGTTGTGTTATTGGG 

* I 1 1 ' 1 1 — i 1 • 1 ' 1 i i 1 10856 

AATGAACCATAACCACCACTGGCAACATATTTTTAAAGTTGACAGGTTAGGTTTGACTGACTAAACTTCACATGATTACAACACAATAACCC 

ULG I GGORC IK ISTVOSKLTDLKCTNVVLLG 
Repflcasa la , — — — — 



TTGTTTGTCTAGTATGAACATTGCAGCTAATTCTAGTGAATGG6CTTATTGTGTTGATTTACACAATAAGATTAATCTTTGTGATGACCCAG 

1 ^ I i | ■ i 1 " I i 1 ' i I J I. i I 1 1 * 10948 

AACAAACAGATCATACTTGTAACGTCGATTAAGATCACTTACCCGAATAACACAACTAAATGTGTTATTCTAATTAGAAACACTACTGGGTC 

CLSSMN I AANSSEWAYCVDLHNK INLCOOP 
■ Replicase 1a 



AAAAAGCTCAAGGTATGTTGTTAGCACTCCTTGCGTTCTTTCTAAGTAAACATAGTGATTTTGGTCTTGATGGCCTTATTGATTCTTATTTT 

1 1 1 H 1 1 h— H . 1 1 I 1 1 1 1 « H 11040 

TTTTTCGAGTTCCATACAACAATCGTGAGGAACGCAAGAAAGATTCATTTGTATCACTAAAACCAGAACTACCGGAATAACTAAGAATAAAA 

EKAOGMtlAlLAFFL SKHS DFGLDGL ! OSTF 
— — _— RopHcase 1 a 1 



GATAATAGTAGCACCCTGCAGAGTGTTGCTTCATCATTTGTTAGTATGCCATCATATATTGCTTATGAAAATGCTAGACAAGCTTATGAGGA 

■ i [ i H . h— 4 ! . 1 1 I- " i 1 1 -4— 11132 

CTATTATCATCGTGGGACGTCTCACAACGAAGTAGTAAACAATCATACGGTAGTATATAACGAATACTTTTACGATCTGTTCGAATACTCCT 

DNSSTLQSVASSFVSMPSY IAYE NARGAYED 
. — Replicase 1a : 



TGCTATTGCTAATGGATCTTCTTCTCAACTTATTAAACAATTGAAGCGTGCCATGAATATCGCAAAGTCTGAATTTGATCATGAGATATCTG . 

— , 1 1 1 1 1 h ~H -m— i ! 1 H-» — ►~H ' ■ * I ' " 1 I 1 11224 

ACGATAACGATTACCTAGAAGAAGAGTTGAATAATTTGTTAACTTCGCACGGTACTTATAGCGTTTCAGACTTAAACTAGTACTCTATAGAC 

Al ANGSSSQL IKQ LKRAMNIAKSEFDHE I S 
— ■ Replicase 1a 



TTCAGAAGAAAATTAATAGAATGGCTGAACAAGCTGCTACTCAGATGTATAAAGAAGCACGCTCTGTTAATAGAAAATCTAAAGTTATTAGT 

^ 1 1 1 1 i i | , 1 ■ 1 1 1 1 1 1 ■ I 11316 

AAGTCTTCTTTTAATTATCTTACCGACTTGTTCGACGATGAGTCTACATATTTCTTCGTGCGAGACAATTATCTTTTAGATTTCAATAATCA 

VQKK I NRMAEQAATGMYKEARSVNRK SKV I S 
■ — -Replicase 1a — — 



GCTATGCACTCTTTACTTTTTGGAATGTTAAGACGTTTGGATATGTCTAGTGTTGAAACTGTTTTGAATTTAGCACGTGATGGTGTTGTGCC 

— «-H 1 1 . I ' 1 1 ^- 1 1 1 h 1 1 H 11408 

CGATACGTGAGAAATGAAAAACCTTACAATTCTGCAAACCTATACAGATCACAACTTTGACAAAACTTA'AATCGTGCACTACCACAACACGG 

AMHSLLFGMLRRLDMSSVETVLNLAROGVVP 

— -Replicase 1a 



ATTGTCAGTTATACCTGCAACTTCAGCTTCCAAACTAACTATTGTTAGTCCAGATCTTGAATCTTATTCTAAGATTGTTTGTGATGGTTCTG 

_l , 1 ■ 1 h f i 1 1 1 1 h 1 h-H h tt500 

TAACAGTCAATATGGACGTTGAAGTCGAAGGTTTGATTGATAACAATCAGGTCTAGAACTTAGAATAAGATTCTAACAAACACTACCAAGAC 



LSV I PA .TSASKLT IVSPDLESYSK I VCOGS 
Replicase 1a — — 



TTCATTATGCTGGAGTTGTTTGGACACTTAATGATGTTAAAGACAATGATGGTAGACCTGTTCATGTTAAAGAGATTACAAGGGAGAATGTT 

, 1 . 1 . 1 1- — H . H ■ i ■ ~ > I 1 1 1 1 1 — 11592 

AAGTAATACGACCTCAACAAACCTGTGAATTACTACAATTTCTGTTACTACCATCTGGACAAGTACAATTTCTCTAATGTTCCCTCTTACAA 



V-HYAGVVWTLNDVKONOGRPVHV'KE i TRE N V 

Replicase 1a^ — . 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

15/87 



GAAACTTTGACATGGCCTCTTATCCTTAATTGTGAACGTGTTGTTAAACTTCAAAATAATGAAATTATGCCTGGTAAACTTAAGCAAAAACC 

— i ! 1 1 ■ m 1 1 j 1 1 i i t 1 1 . 1 1 11681 

CTTTGAAACTGTACCGGAGAATAGGAATTAACACTTGCACAACAATTTGAAGTTTTATTACTTTAATACGGACCATTTGAATTCGTTTTTGG 

ETtTWPL i LNCERVVK'LQNNEIHPGXLKQKP 
— ; Replicase la 1 



TATGAAAGCTGAGGGTGATGGTGGTGTTTTAGGTGATGGTAATGCTTTGTATAATACTGAGGGTGGTAAAACTTTTATGTATGCTTATATTT 

i , , 1 1 1 1 \ 1 1 1 I 1 1 i 1 ■ ■ i ■ I i' 11776 

ATACTTTCGACTCCCACTACCACCACAAAATCCACTACCATTACGAAACATATTATGACTCCCACCATTTTGAAAATACATACGAATATAAA 

MK AEGOGGVLGDGNALYNT. EGGKTFMYAY ! 
: Replicase 1a 



CTAATAAAGCTGACCTTAAATTTGTTAAGTGGGAGTATGAGGGTGGTTGCAACACAATCGAGTTAGACTCTCCTTGTCGATTTATGGTCGAA 

1 1 \ i 1 1 1 h~-h 1 1 1 H — « 1 ' i ■ I 1 11868 

GATTATTTCGACTGGAATTTAAACAATTCACCCTCATACTCCCACCAACGTTGTGTTAGCTCAATCTGAGAGGAACAGCTAAATACCAGCTT 

SNKAOLKFV'KWEYEGGCNT 1 ELDSPCRFMVE 
Replicase 1a 



ACACCTAATGGTCCTCAAGTGAAGTATTTGTATTTTGTTAAAAATTTAAATACCTTACGTAGAGGTGCCGTTCTTGGTTTTATAGGTGCCAC 

-H 1 1 1 1 »'■'■! 1 -H 1 1 1 1 1 1" I 1 ' " I 11960 

TGTGGATTACCAGGAGTTCACTTCATAAACATAAAACAATTTTTAAATTTATGGAATGCATCTCCACGGCAAGAACCAAAATATCCACGGTG 

TP NGP QVK YL.YFVKNLNTLRRGAVLGF I GAT 
Replicase 1a- 



AATTCGTCTACAAGCTGGTAAACAAACTGAATTGGCTGTTAATTCTGGACTTTTAACTGCTTGTGCTTTTTCTGTTGATCCAGCAACCACTT 

m 1 1 , 1 1 1 i l > I h- 1 I ' i 1 1 — 12052 

TTAAGCAGATGTTCGACCATTTGTTTGACTTAACCGACAATTAAGAeCTGAAAATTGACGAACACGAAAAAGACAACTAGGrTCGTTGGTGAA 

1 RL QAGKOTEL AVNSGLLTACAFS VOPATT 
Replicase 1a 



ACTTGGAAGCTGTTAAACATGGTGCAAAACCTGTAAGTAATTGTATTAAGATGTTATCTAATGGTGCTGGTAATGGTCAAGCTATAACAACT 

— I !-= i 1 1 h~ 1— h 1 i 1 1 1 

TGAACCTTCGACAATTTGTACCACGTTTTGGACATTCATTAACATAATTCTACAATAGATTACCACGACCATTACCAGTTCGATATTGTTGA 

YLEAVKHGAKPVSNC I KHLSNGAGNGQA I TT 
— Replicase 1a — 1 



AGTGTAGATGCTAACACCAATCAAGATTCTTATGGTGGAGCGTCTATTTGTTTGTATTGTCGGGCCCACGTTCCTCACCCTAGTATGGATGG 

H H , 1 , 1 1 ■■ | .... i 1 1 H 1 H— ■ H~ »- 12236 

TCACATCTACGATTGTGGTTAGTTCTAAGAATACCACCTCGCAGATAAACAAACATAACAGCCCGGGT.GCAAGGAGTGGGATCATACCTACC 

SVOAN TNO DSYGGASI CLYCRAHVPHPSMOG 
— Replicase 1a 



TTACTGTAAGTTTAAGGGTAAATGTGTTCAGGTTCCTATTGGTTGTTTGGATCCTATTAGGTTTTGTTTAGAAAATAATGTGTGTAATGTTT 

■ 1 ■ ■ i 1 ■ i 1 < 1 1 1 1 1 ' < 1 \ 1 1 12328 

AATGACATTCAAATTCCCATTTACACAAGTCCAAGGATAACCAACAAACCTAGGATAATCCAAAACAAATCTTTTATTACACACATTACAAA 

YCKFKGKCVOVPIGCLDPIRFCLENNVCNV 

Replicase 1a 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/OG0805 

16/87 



GTGGTT6TTGGTTGGGACACGGGTGTGCTTGTGATCGTACAACCATTCAAAGTGTTGACATTTCTTATTTAAACGAGCAAGGGGTTCTAGTG 

I ' » ' 1 ■ I 1 > I » \ I i I i h 12420 

CACCAACAACCAACCCTGTGCCCACACGAACACTAGCATGTTGGTAAGTTTCACAACTGTAAAGAATAAATTTGCTCGTTCCCCAAGATCAC 

CGCW L GHGCACORTT | QSVD I S.Y L N E 0 G V L V 
Replicase 1 a : 

. R A R G S S 
1 Replicase 1b 

CAGCTCGACTAGAACCCTGTAATGGCACGGACATCGATAAGTGTGTTCGTGCTTTTGACATTTATAATAAAAATGTTTCATTCTTGGGTAAG 

, 1 , 1 . i . 1 , 1 i 1 1 I ■ ■ ■ I i 1 1 1 — 12512 

GTCGAGCTGATCTTGGGACATTACCGTGCCTGTAGCTATTCACACAAGCACGAAAACTGTAAATATTATTTTTACAAAGTAAGAACCCATTC 

OLD.. 
-Replicase 1a J 

AARLEPCNGTO IOKCVRAFOI YNKNVSFtGK 
Replicase lb '• : : 



TGTTTGAAGATGAACTGTGTTCGTTTTAAAAATGCTGATCTTAAGGATGGTTATTTTGTTATAAAGAGGTGTACTAAGTCGGTTATGGAACA 

i I i i ' I I I i 1 I t I I 1 * i 12604 

ACAAACTTCTACTTGACACAAGCAAAATTTTTACGACTAGAATTCCTACCAATAAAACAATATTTCTCCACATGATTCAGCCAATACCTTGT 

CLKMNCVRFKNA0LKDGYFV1 KRCTKSVMEH 
Replicase 1b 



CGAGCAATCCATGTATAACCTACTTAACTTTTCTGGTGCTTTGGCTGAGCATGATTTCTTTACTTGGAAAGATGGCAGAGTCATTTATGGTA 

h 1 , 1 , 1 i-h 1 ■ 1 ■ 1 1 , H | ■ 1- 12696 

GCTCGTTAGGTACATATTGGATGAATTGAAAAGACCACGAAACCGACTCGTACTAAAGAAATGAACCTTTCTACCGTCTCAGTAAATACCAT 

EQSMYNLLNFSGALAEHDFFTWKDGRV I YG 
; Replicase 1b 



ATGTTAGTAGACATAATCTTACTAAATATACTATGATGGACTTGGTTTATGCTATGCGTAACTTTGATGAACAAAATTGTGATGTTCTAAAA 

1 1 1 , 1 . 1 , h , 1 ^ , i ■ ■ : — t 1 12788 

TACAATCATCTGTATTAGAATGATTTATATGATACTACCTGAACCAAATACGATACGCATTGAAACTACTTGTTTTAACACTACAAGATTTT 

NVSRH NL TK Y TMMDLVYAMRNFDEQ NCDVLK 
Replicase 1b 



GAAGTATTAGTTTTAACTGGTTGTTGTGACAATTCTTATTTTGATAGTAAGGGTTGGTATGACCCAGTTGAAAATGAAGATATACATAGAGT 

*^ , 1 1 h m 1 1 1 h i 1 h 1 ■ ■ ' I I 1 ' . 1 ■ h 12880 

CTTCATAATCAAAATTGACCAACAACACTGTTAAGAATAAAACTATCATTCCCAACCATACTGGGTCAACTTTTACTTCTATATGTATCTCA 

EVL VL T G C CO NSY FOSK GWY DPVE N E O.I.HR V 
Replicase 1b 



TTATGCATCTCTTGGCAAAATTGTAGCTAGAGCTATGCTTAAATGCGTTGCTCTATGTGATGCGATGGTTGCTAAAGGTGTTGTTGGTGTTT 

I 1 H 1 1 h H • 1 i-h j m H ■ i ■ ■ I ■ ■ ■ i 1 — 12972 

AATACGTAGAGAACCGTTTTAACATCGATCTCGATACGAATTTACGCAACGAGATACACTACGCTACCAACGATTTCCACAACAACCACAAA 



YASLGK I VARAHLKCVALCOAMVAKGVVGV 
Replicase 1b : 



TAACATTAGATAACCAAGATCTTAATGGTAACTTTTATGATTTTGGTGATTTTGTTGTTAGCTTACCTAATATGGGTGTTCCCTGTTGTACA 

— , 1 I 1 1 ^ 1 1 I 1 H— h ■ I 1 1— 13064 

ATTGTAATCTATTGGTTCTAGAATTACCATTGAAAATACTAAAACCACTAAAACAACAATCGAATGGATTATACCCACAAGGGACAACATGT 

LTL ONOOLNGNFYOFGOFVVSLPNMGVPCCT 

Replicase 1 b — 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

17/87 



TCATATTATTCTTATATGATGCCTATTATGGGTTTAACTAATTGTTTAGCTAGTGAGTGTTTTGTCAAGAGTGATATTJTT6GTAGTGATTT 

i 1 ' ' t H " I i I I ■ 1 1 ' . ' I ■ i i ■ | ■■■ i I m 13156 

AGTATAATAAGAATATACTACGGATAATACCCAAATTGATTAACAAATCGATCACTCACAAAACAGTTCTCACTATAAAAACCATCACTAAA 

SY YSYHMP IMGLTNCLASECFVKSD I F G S 0 F 
; Replicase 1b : ' 



TAAAACTTTTGATTTGCTTAAGTATGATTTCACTGAACATAAAGAAAATTTATTCAATAAGTACTTTAAGCATTGGAGTTTTGATTATCATC 

1 ■ I i 1 — *-h 1 1 1 1 i 1 1 I 1 1 1 i 1 132*18 

ATTTTGAAAACTAAACGAATTCATACTAAAGTGACTTGTATTTCTTTTAAATAAGTTATTCATGAAATTCGTAACCTCAAAACTAATAGTAG 



KTFDILKYDF TEHKENLFNKYFKHWSFO YH 
Replicase 1b ; 



CTAATTGTAGTGACTGTTATGATGATATGTGTGTTATACATTGTGCTAATTTTAATACACTATTTGCCACAACTATACCAGGTACTGCTTTT 

-H 1 1 h— I 1 1 1 h— • 1 • 1 1 1 * — H I 13340 

GATTAACATCACTGACAATACTACTATACACACAATATGTAACACGATTAAAATTATGTGATAAACGGTGTTGATATGGTCCATGACGAAAA 

PNCSOCYOOMCV. IHCANFNTLFATT I PGTAF 
■ Replicase 1b — — ■ — 



GGTCCACTATGTCGTAAAGTTTTTATAGATGGTGTTCCACTTGTTACAACTGCTGGTTATCATTTTAAGCAATTAGGTTTGGTTTGGAATAA 

1 1 i 1 h j 1 1 h 1 . 1 1 ■ 1 1 " 1 — 13432 

CCAGGTGATACAGCATTTCAAAAATATCTACCACAAGGTGAACAATGTTGACGACCAATAGTAAAATTCGTTAATCCAAACCAAACCTTATT 

GPLC RKVF I DGVPLVTTAGYHFKOLGLVWNK 
Replicase 1b 



AGATGTTAACACACACTCAGTTAGGTTGACAATCACTGAACTTTTGCAATTTGTTACTGACCCTTCCTTGATAATAGCTTCTTCTCCAGCAC 

_ I i 1 1 I 1 1 — — h 1 H i I 1 13524 

TCTACAATTGTGTGTGAGTCAATCCAACTGTTAGTGACTTGAAAACGTTAAACAATGACTGGGAAGGAACTATTATCGAAGA'AGAGGTCGTG 

DVNTHSVRLT 1 TELLOFVTOPSL 1 IASSPA 
— Replicase 1b ■ — : '• t - 



TCGTTGATCAACGCACTATTTGTTTTTCTGTTGCAGCATTGAGTACTGGTTTGACAAATCAAGTTGTTAAGCCAGGTCATTTTAATGAAGAG 

h 1 | 1 1 — n 1— i 1 I | ... i | I 13616 

AGCAACTAGTTGCGTGATAAACAAAAAGACAACGTCGTAACTCATGACCAAACTGTTTAGTTCAACAATTCGGTCCAGTAAAATTACTTCTC 

LVDQRT I - C FSVAALSTGLTNQVVKPGHF NEE 
Replicase 1b ; 



TTTTATAACTTTCTTCGTTTAAGAGGTTTCTTTGATGAAGGTTCTGAACTTACATTAAAACATTTCTTCTTCGCACAGAATGGTGATGCTGC 

» H • 1 1 1 1 1 « 1 — ' 1 1 H 13708 

AAAATATTGAAAGAAGCAAATTCTCCAAAGAAACTACTTCCAAGACTTGAATGTAATTTTGTAAAGAAGAAGCGTGTCTTACCACTACGACG 

FYNFLRLRGFFOEGSELTLKHFF FAQNGOAA 
— Replicase 1b 



TGTTAAAGATTTTGACTTTTACCGTTATAATAAGCCTACCATTTTAGATATTTGTCAAGCTAGAGTTACATATAAGATAGTCTCTCGTTATT 

-H 1 1 t 1 1 1 I 1 1 i >'■■■ * 1 1 1 1 H 13800 

ACAATTTCTAAAACTGAAAATGGCAATATTATTCGGATGGTAAAATCTATAAACAGTTCGATCTCAATGTATATTCTATCAGAGAGCAATAA 

VKOFDFYRYNKPT IL01CQARVTYK I VSRY 
■ Replicase 1b 



TTGACATTTATGAAGGTGGCTGTATTAAGGCATGTGAAGTTGTTGTAACAAATCTTAATAAGAGTGCTGGTTGGCCATTAAATAAGTTTGGT 

. \ , 1 I 1 1 H 1 1 I ■ ■ 1 1 1 1 13892 

AACTGTAAATACTTCCACCGACATAATTCCGTACACTTCAACAACATTGTTTAGAATTATTCTCACGACCAACCGGTAATTTATTCAAACCA 

FO I YEGGC I KACEVVVTNLNKSAGWPLNKFG 
— Replicase 1b 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

18/87 



AAAGCTAGTTTGTATTACGAATCTATATCTTATGAAGAACAGGATGCTTTGTTTGCTTTGACAAAGCGTAATGTCCTCCCTACTATGACACA 

— i 1 1 I 1 ' — *H 1 1 1 1 H i 1 1 13984 

TTTCGATCAAACATAATGCTTAGATATAGAATACTTCTTGTCCTACGAAACAAACGAAACTGTTTCGCATTACAGGAGGGATGATACTGT6T 

KA'SLYYES ISYEEQOALFALTKRNVIPTMTO 
Replicase 1b — ■ 



GCTGAATCTTAAGTATGCTATTAGTGGTAAAGAACGTGCTAGAACTGTTGGTGGTGTTTCTCTGTTGTCCACAATGACCACAAGACAATACC 

h 1 1 1 . H 1 1 1 h 1 I— ■ I ■ ' ■ i I I 1- 14076 

CGACTTAGAATTCATACGATAATCACCATTTCTTGCACGATCTTGACAACCACCACAAAGAGACAACAGGTGTTACTGGTGTTCTGTTATGG 

LNLK YA I SGKERARTVGGVSILSTMTTRQY 
: — Replicase 1b 



ATCAAAAACATCTTAAATCCATTGTTAATACACGCAATGCCACTGTTGTTATTGGTACTACCAAATTTTATGGTGGTTGGAATAATATGTTG 

1 , j , 1 , 1 i 1 < 1 1 ■ i 1 * 1 1 ■ ■ ■ r I 14168 

TAGTTTTTGTAGAATTTAGGTAACAATTATGTGCGTTACGGTGACAACAATAACCATGATGGTTTAAAATACCACCAACCTTATTATACAAC 

HQKHLKS I VNTRNATVV 1 GTTKFYGGWNNML 
; Replicase 1b 



CGTACTTTAATTGATGGTGTTGAAAACCCTATGCTCATGGGTTGGGATTATCCCAAATGTGATAGAGCTTTGCCTAACATGATACGTATGAT 

•H 1 1 1 1 1 ~ H 1 1 1 1 . ■ ■ i H m 1 I 14260 

GCATGAAATTAACTACCACAACTTTTGGGATACGAGTACCCAACCCTAATAGGGTTTACACTATCTCGAAACGGATTGTACTATGCATACTA 

RTL 1 DG.VENPHLHGWDYPKCORALPNM I R M 1 
: Replicase 1b 



TTCAGCCATGGTGTTGGGTTCTAAGCATGTTAATTGTTGTACTGTAACAGATAGGTTTTATAGGCTTGGTAACGAGTTGGCACAAGTTTTAA 

1 j , k— h 1 1 1 »— H 1 1 m. 1 1 H— i — i 1 — 14352 

AAGTC'GGTACCACAACCCAAGATTCGTACAATTAACAACATGACATTGTCTATCCAAAATATCCGAACCATTGCTCAACCGTGTTCAAAATT 

SAMVIGSKHVNCCTVT ORFYRLGNELAOVL 
: Replicase 1 b = 



CAGAAGTTGTTTATTCTAATGGTGGTTTTTATTTTAAGCCAGGT.GGTACGACTTCTGGTGACGCTAGTACAGCTTATGCTAATTCTATTTTT 

— »-h 1 , 1 1 1 h~-h , 1 , 1 1 | . . i . H ■ ■ i . ■ ■ ■ | ■ ■ — \MW 

GTCTTCAACAAATAAGATTACCACCAAAAATAAAATTCGGTCCACCATGCTGAAGACCACTGCGATCATGTCGAATACGATTAAGATAAAAA 

TEVVYSNGGFYFKPGGTTSG DASTAYANS I F 
Replicase 1b 



AACATTTTTCAAGCCGTGAGTTCTAACATTAACAGGTTGCTTAGTGTCCCATCAGATTCATGTAATAATGTTAATGTTAGGGATCTACAACG 

H 1 1 1 1 1 1 1 1 h I 'i 1 14536 

TTGTAAAAAGTTCGGCACTCAAGATTGTAATTGTCCAACGAATCACAGGGTAGTCTAAGTACATTATTACAATTACAATCCCTAGATGTTGC 

NIFOAVSSNINRLLSVPSOSCNNVNVROLQR 
: Replicase 1b 



ACGTCTGTATGATAATTGCTATAGGTTAACTAGTGTTGAAGAGTCATTCATTGATGATTATTATGGTTATCTTAGGAAACATTTTTCAATGA 

1 , 1 1 j , 1 1 1 — I 1 ■ i 1 1 14628 

TGCAGACATACTATTAACGATATCCAATTGATCACAACTTCTCAGTAAGTAACTACTAATAATACCAATAGAATCCTTTGTAAAAAGTTACT 

RLYONCYRLTSVEESFIODYYGYLRKHFSM 
Replicase 1b 



TGATTCTCTCTGATGACGGTGTTGTCTGTTATAACAAGGATTATGCTGAGTTAGGTTATATAGCAGACATTAGTGCTTTTAAAGCCACTTTG 

■H 1 1 • 1 • 1 1 " I 1 1 1 ' ■ I ■ i 1 ■ ■ I 1 » h 14720 

ACTAAGAGAGACTACTGCCACAACAGACAATATTGTTCCTAATACGACTCAATCCAATATATCGTCTGTAATCACGAAAATTTCGGTGAAAC 



Ml LSOOGVVCYNKO YAELGY IAO ISAFKATL 
— Replicase 1b 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 



19/87 



PCT/NL2004/000805 



TATTACCAGAATAATGTCTTTATGAGTACTTCTAAATGTTGGGTTGAAGAAGATTTAACTAAGGGACCACATGAGTTTTGTTCCCAGCATAC 

, 1 1 h h— H 1 i— • ■ 1 ■ i ' I \ 1 I 1 I 14812 

ATAATGGTCTTATTACAGAAATACTCATGAAGATTTACAACCCAACTTCTTCTAAATTGATTCCCTGGTGTACTCAAAACAAGGGTCGTATG 

YY'O-NNVFHSTSKCWVEEOLTKGPHEFCSOHT 
; RepUcase 1b 



TATGCAAATAGTTGATAAAGATGGTACCTATTATTTGCCTTACCCAGATCCTAGTAGGATCTTGTCAGCTGGTGTTTTTGTTGATGATGTTG 

— , 1 . 1 ■ ■ i 1 h h "f 1 • 1 1 I 1 ■ ■ I ' ' ' 14904 

ATACGTTTATCAACTATTTCTACCATGGATAATAAACGGAATGGGTCTAGGATCATCCTAGAACAG7CGACCACAAAAACAACTACTACAAC 



HQ! VOKOGTYYLPYPOPSRILSAG'VFVODV 
Replicase 1b 



TTAAGACAGATGCTGTTGTTTTGtTAKAACGTTATGTGTCTTTAGCTATTGATGCATACCCTCTTTCAAAACACCCTAATTCTGAATATCGT 

•t ) 1 1 i n i > i i | ■ i i 1 ■ i H 1 H > ■ I I ' ' < ■ 14996 

AATTCTGTCTACGACAACAAAACAATMTTGCAATACACAGAAATCGATAACTACGTATGGGAGAAAGTTTTGTGGGATTAAGACTTATAGCA 

VKTDAVVL17RYVSLA I DAYPLSKHPNSEYR 

Replicase 1b 



AAGGTTTTTTACGTATTACTTGATTGGGTTAAGCATCTTAACAAAAATTTGAATGAGGGTGTTCTTGAATCTTTTTCTGTTACACTTCTTGA 

1 1 1 1 1 . { -h ! 1 1 . I i I — I -h 15088 

TTCCAAAAAATGCATAATGAACTAACCCAATTCGTAGAATTGTTTTTAAACTTACTCCCACAAGAACTTAGAAAAAGACAATGTGAAGAACT 

KVFYVLLDWVKHLNKNLNEGVLESFSVTLLD 
: Replicase 1b ; - 



TAATCAAGAAGATAAGTTTTGGTGTGAAGATTTTTATGCTAGTATGTATGAAAATTCTACAATATTGCAAGCTGCTGGCTTATGTGTTGTTT 

H — ~t 1 1 ~H 1 1 1 1 h~* — I 1 ■ ■ ■ I ■ ■ ■ * i 1 T 15180 

ATTAGTTCTTCTATTCAAAACCACACTTCTAAAAATACGATCATACATACTTTTAAGATGTTATAACGTTCGACGACCGAATACACAACAAA 

MOE DK FWC EOF YASHY'EN S T-ItQA.AGLCVV 
— Replicase 1b : 



GTGGTTCACAAACTGTTCTTCGTTGTGGTGATTGTCTGCGTAAGCCTATGTTGTGCACTAAATGTGCATATGATCATGTATTTGGTACCGAC 

, 1 1 1 1 1 H 1 1— i 1— • 1 »— H I ■ ■ 15272 

CACCAAGTGTTTGACAAGAAGCAACACCACTAACAGACGCATTCGGATACAACACGTGATTTACACGTATACTAGTACATAAACCATGGCTG 

CGSOTVLRCGOCLRKPMLCTKCAYDHVFGTD 
Repiicase 1b — — ■ 



CACAAGTTTATTTTGGCTATAACACCGTATGTATGTAATGCATCAGGTTGTGGTGTTAGTGATGTTAAAAAATTGTATCTTGGTGGTTTGAA 

— h~ 1 H h 1 h- 1 «H -h 1 h 1 h 1 I h 1 15364 

GTGTTCAAATAAAACCGATATTGTGGCATACATACATTACGTAGTCCAACACCACAATCACTACAATTTTTTAACATAGAACCACCAAACTT 

HKF I LA I TPYVCNASGCGVSDVKKIYLGGLN 
—Replicase 1b 



TTACTATTGTACAAATCATAAACCACAGTTGTCTTTTCCATTATGTTCTGCTGGTAATATATTTGGTTTATATAAAAATTCAGCAACTGGTT 

h 1 1 j— h~ 1 h h h-h , 1 , I I H 15456 

AATGATAACATGTTTAGTATTTGGTGTCAACAGAAAAGGTAATACAAGACGACCATTATATAAACCAAATATATTTTTAAGTCGTTGACCAA 

YYCTNHKPGLSFPLCSAGN 1 FGLYKNSATG 

Replicase 1b • 



CCTTAGATGTTGAAGTTTTTAATAGGCTTGCAACGTCTGATTGGACTGATGTTAGGGACTATAAACTTGCTAATGATGTTAAAGATACACTT 

1 1 1 1 1 1— | »~H 1 1 1 1 ■ ■ ■ i 1 ■ I H 1 15548 

GGAATCTACAACTTCAAAAATTATCCGAACGTTGCAGACTAACCTGACTACAATCCCTGATATTTGAACGATTACTACAATTTCTATGTGAA 

SLOVEVFNRLATSO WTOVRDYKLANOVKOTL 
. — Replicase 1b— 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 



20/87 



PCT/NL2004/000805 



AGACTCTTTGCGGCTGAAACTATTAAAGCTAAAGAAGAGAGTGTTAAGTCTTCTTATGCTTTTGCAACTCTTAAAGAGGTTGTTGGACCTAA 

1 ' 1 1 1 ' ' 1 » * 1 1 1 ' I i h h 15640 

TCTGAGAAACGCCGACTTTGATAATTTCGATTTCTTCTCTCACAATTCAGAAGAATACGAAAACGTTGAGAATTTCTCCAACAACCTGGATT 

RLFAAE T I KAKEESVKSSYAFATLKEVV GPK 
Replicaso 1b 



AGAATTGCTTCTTAGTTGGGAAAGTGGTAAAGTTAAACCACCTTTGAATCGTAATTCTGTTTTCACCTGTTTTCAAATAAGTAAGGACTCAA 

i 1— h 1 1 1 1 «-H 1 1 1 1 1 1 1 1 I 1 1 1- 1 — 15732 

TCTTAACGAAGAATCAACCCTTTCACCATTTCAATTTGGTGGAAACTTAGCATTAAGACAAAAGTGGACAAAAGTTTATTCATTCCTGAGTT 

ELLLSWESGKVKPPLNRNSVFTCF'a ISKOS 
— Repllcase 1b 



AATTCCAAATAGGTGAGTTCATCT TTGAAAAGGTTGAATATGGTTCTGATACTGTTACGTATAAGTCTACTGTAACCACTAAGTTAGTTCCT 

( 1 ■ ~ I 1 «-H 1 1 1 I I 1 1 15824 

TTAAGGTTTATCCACTCAAGTAGAAACTTTTCCAACTTATACCAAGACTATGACAATGCATATTCAGATGACATTGGTGATTCAATCAAGGA 

KFQ'IGEF I FEKVEYGSDTVTYKSTVTTKLVP 
— ■ Replicase 1b 



GGTATGATTTTTGTCTTAACATCTCACAATGTTCAACCTTTACGTGCACCAACTATTGCAAACCAAGAGAAGTATTCTAGCATTTATAAATT 

4 1 , 1 1 1 1 1 , 1 , 1 , h 1 1 1 15916 

CCATACTAAAAACAGAATTGTAGAGTGTTACAAGTTGGAAATGCACGTGGTTGATAACGTTTGGTTCTCTTCATAAGATCGTAAATATTTAA 

GUI FVLTSHNVOPLRAPT IANQEKYSS1 YKL 
■ : Replicase 1b 



GCACCCTGCTTTTAATGTCAGTGATGCATATGCTAATTTGGTTCCATATTACCAACTTATTGGTAAACAAAAGATAACTACAATACAGGGTC 

1 1 1 1 1 ^ 1 , 1 1 h 1 1 i ■ » ■ ■ 1 ■ 1 16008 

CGTGGGACGAAAATTACAGTCACTACGTATACGATTAAACCAAGGTATAATGGTTGAATAACCATTTGTTTTCTATTGATGTTATGTCCCAG 

HPAF N VS D AY ANLVPY Y 0 I I G KOK 1 TT 1 OG 
; Replicase 1b 



CTCCTGGTAGTGGTAAGTCACATTGTTCCATTGGACTTGGATTGTACTATCCAGGTGCGCGTATTGTTTTTGTTGCTTGTGCCCATGCTGCT 

_l . J 1 H I 1 1 ■ ■ i ■ ■ ■ ■ | 1 ■ ■ .1 | I 1 16100 

GAGGACCATCACCATTCAGTGTAACAAGGTAACCTGAACCTAACATGATAGGTCCACGCGCATAACAAAAACAACGAACACGGGTACGACGA 

PPGSGKSHCS I GLGLYYPGAR 1 VFVA CAHAA 
Replicase 1b 



GTTGATTCCTTATGTGCAAAAGCTATGACTGTTTATAGCATTGATAAGTGTACTAGGATTATACCTGCAAGAGCTCGGGTTGAGTGTTATAG 
1 1 v~ 1 ~* 1~ , H h I", i \ ii i i 1 i 1 — 16192 

CAACTAAGGAATACACGTTTTCGATACTGACAAATATCGTAACTATTCACATGATCCTAATATGGACGTTCTCGAGCCCAACTCACAATATC 

VOSLCAK AMTVYS l DKCTR l I PARARVECYS 
: : Replicase 1b : 



TGGCTTTAAACCAAATAACACTAGTGCACAATACATATTTAGCACTGTTAACGCATTACCTGAGTGTAATGCTGATATTGTTGTTGTAGATG 

— i -H 1 1 1 1 1 1 ■ I I 1 \ I 1 16284 

ACCGAAATTTGGTTTATTGTGATCACGTGTTATGTATAAATCGTGACAATTGCGTAATGGACTCACATTACGACTATAACAACAACATCTAC 

GFKP NNTSAOY I FSTVNALPECNAO I VVVO 
Replicase 1b 



AAGTTTCAATGTGTACAAATTATGACCTTTCTGTTATTAATCAGCGTTTATCATATAAACATATTGTTTATGTTGGTGATCCACAACAACTT 

^ H 1 1 I 1 1 h— H ~h |-^-h 1 1 ■ ■ ■ I ■ 1 ' 1 I h- 16376 

TTCAAAGTTACACATGTTTAATACTGGAAAGACAATAATTAGTCGCAAATAGTATATTTGTATAACAAATACAACCACTAGGTGTTGTTGAA 

EVSMC TNYOtSV lNQRLSYKH IVYVG DPQQ L 

»1b — : 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

21/87 



CCTGCACCTAGAGTAATGATTACTAAAG GTGTTATgGAGCCTGTTGATTATAACGTT 6TTACTCAACGTATGTGTGCTATAGGCCCTGATGT 

— H •' 1 1 1 1 1 1 1 ' 1 ' ' 1 ' I i I i 16468 

GGACGTGGATCTCATTACTAATGATTTCCACAATACCTCGGACAACTAATATTGCAACAATGAGTTGCATACACACGATATCCGGGACTACA 

PA'PRVH I TKGVttEPVQYNVVTQRHCA I GPOV 
— — ^— — — — Replicaso 1 b ■ 

TTTTCTTCATAAATGTTATAGATGTCCTGCTGAAATAGTTAATACAGTTTCTGAACTTGTTTATGAGAACAAGTTTGTCCCTGTTAAACCTG 

H 1 1 ^ 1 1 1 1 . 1 ~h ~j i \ _| , j. J6 5 60 

AAAAGAAGTATTTACAATATCTACAGGACGACTTTATCAATTATGTCAAAGACTTGAACAAATACTCTTGTTCAAACAGGGACAATTTGGAC 

FLHKCYRCPAEIVNT-VSELVYENKFVPVKP 
Repllcase 1b 



CTAGTAAACAGTGTTTTAAAATCTTTTTTAAGGGTAATGTACAGGTTGACAATGGCTCTAGTATTAACAGAAAGCAGCTTGAAATAGTTAAG. 
1 ) 1 1 1 1 • 1 1 I ■ 1 ■ i J 1 ■ 16652 

GATCATTTGTCACAAAATTTTAGAAAAAATTCCCATTACATGTCCAACTGTTACCGAGATCATAATTGTCTTTCGTCGAACTTTATCAATTC 

ASKQCFK! FFKGNVOVONGSSINRKQLE I VK 
— ; • Repllcase 1 b ■- — 



CTGTTTTTAGTTAAAAATCCAAGTTGGAGTAAGGCTGTGTTTATTTCTCCTTATAATAGTCAGAATTATGTTGCTAGTAGATTTTTAGGACT 

— i 1 i I ' i 1 1 > ' 1 1 1 1 1 I 16744 

GACAAAAATCAATTTTTAGGTTCAACCTCATTCCGACACAAATAAAGAGGAATATTATCAGTCTTAATACAACGATCATCTAAAAATCCTGA' 

LFLVK NPSWSKAVF I SPYNSONYVASRFLCL 
1 : ■ Replicaso lb 



TCAAATTCAAACTGTTGATTCTTCTCAAGGTAGTGAGTATGATTATGTAATCTATGCACAAACTTCTGACACTGCACATGCTTGCAATGTAA 

h H . 1 - 1 . 1 ■ Hj— 1 , 1 1 1 16836 

AGTTTAAGTTTGACAACTAAGAAGAGTTCCATCACTCATACTAATACATTAGATACGTGTTTGAAGACTGTGACGTGTACGAACGTTACATT 

01 QTV DSSQGSEYDYV1 YAQTSDTAH-ACNV 
: Repllcase 1b- 1 



ACCGTTTTAATGTTGCTATAACACGTGCTAAGAAGGGTATATTTTGTGTAATGTGTGATAAAACTTTGTTTGATTCACTTAAGTTTTTTGAG 

1 1 ! 1 1 1 1 ~H 1 1 1 , | ■ , ■ , 1 1 1 16928 

TGGCAAAATTACAACGATATTGTGCACGATTCTTCCCATATAAAACACATTACACACTATTTTGAAACAAACTAAGTGAATTCAAAAAACTC 

NRFNV A I TRAKKG I FCVMCDKTLFDSIKFFE • 
-Replicase 1b — 

ATTAAACATGCAGATTTACACTCTAGCCAGGTTTGTGGCTTGTTTAAAAATTGTACACGCACTCCTCTTAATTTACCACCAACTCATGCACA 

l | "" > 11,1 1 ' ' " " 1 I • ' ' I ■ i " H— h h 17020 

TAATTTGTACGTCTAAATGTGAGATCGGTCCAAACACCGAACAAATTTTTAACATGTGCGTGAGGAGAATTAAATGGTGGTTGAGTACGT6T 

1 K HA D LH S SQVC.GLFK NCT R TPLNL PP T HAH 
Repllcase 1b 



CACTTTCTTGTCGTTGTCAGATCAGTTTAAGACTACAGGTGATTTAGCTGTTCAAATAGGTTCAAATAATGTTTGTACTTATGAACATGTTA 

1 1 1 1 h 1 ■! I ' 1 1 1 1 h i !■■ 17 1 12 

GTGAAAGAACAGCAACAGTCTAGTCAAATTCTGATGTCCACTAAATCGACAAGTTTATCCAAGTTTATTACAAACATGAATACTTGTACAAT 

TFL5LSDQFKTTG0LAVQIGSNNVCTYEHV 
Repllcase 1b — 



T'ATCATTTATGGGTTTTAGGTTTGATATTAGTATTCCTGGTAGTCATAGTTTGTTTTGTACACGTGACTTTGCTATTCGTAATGTGCGTGGT 

— « 1 1 1' " ' 1 ' I 1 ' I ' » I ' ' i I 1 1 17204 

ATAGTAAATACCCAAAATCCAAACTATAATCATAAGGACCATCAGTATCAAACAAAACATGTGC ACTGAAACGATAAGCATTACACGCACCA 

ISFMGFRFDI S IPG -SHS lFCTROFA I RNVRG- 
—Repllcase 1b • 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 



22/87 



PCT/NL2004/000805 



TGGTTGGGTATGGATGTTGAAAGTGCTCATGTTTGTGGCGATAACATAGGTACTAATGTTCCTTTACAGGTTGGTTTTTCAAATGGTGTTAA 

h 1 1 1 I 1 1 1— I 1 1 1 1 1 1 1 1 h- 17296 

ACCAACCCATACCTACAACTTTCACGAGTACAAACACCGCTATTGTATCCATGATTACAAGGAAATGTCCAACCAAAAAGTTTACCACAATT 

WLGNDVE SAHVCGQN l GTNVPLQ.VGF SNGVN 
— — •■ — — Replicase 1 b — — — — 



TTTTGTTGTGCAAACTGAAGGTTGTGTGTCTACCAATTTTGGTGATGTTATTAAACCTGTTTGTGCAAAATCTCCACCAGGTGAACAATTTA 

1 1 1 , l 1 , 1 , \ ■ 1 1 1 , 1 h~ t7388 

AAAACAACACGTTTGACTTCCAACACACAGATGGTTAAAACCACTACAATAATTTGGACAAACACGTTTTAGAGGTGGTCCACTTGTTAAAT 



FVVQTEGCVSTNFGDVIKPVCAKSPPGEOF 
Replicase 1b 



GACACCTTGTTCCTTTTTTACGTAAAGGACAACCTTGGTTAATTGTTCGTAGACGCATTGTGCAAATGATATCTGATTATTTGTCCAATTTG 

H 1 H— i 1 1 1 h- 1 1 1 1 —I »— — » 1 I • -+ 17480 

CTGTGGAACAAGGAAAAAATGCATTTCCTGTTGGAACCAATTAACAAGCATCTGCGTAACACGTTTACTATAGACTAATAAACAGGTTAAAC 



RHLVPFLRKGQPWL IVRRR I V 0 M I SDYLSNL 

Replicase 1b 



TCTGACATTCTTGTCTTTGTTTTGTGGGCAGGTAGTTTGGAATTAACTACAATGCGTTACTTTGTAAAAATAGGGCCAATTAAATATTGTTA 

, 1 ■ i 1 h 1 H 1 1 > 1 1 1 ~h - i 1 — 17572 

AGACTGTAAGAACAGAAACAAAACACCCGTCCATCAAACCTTAATTGATGTTACGCAATGAAACATTTTTATCCCGGTTAATTTATAACAAT 

SDILVFVLWAGSLELTTMRYFVKIGPIKYCY 
— Replicase 1b ■ 



TTGTGGTAATTCTGCCACTTGTTATAATTCAGTTAGTAATGAATATTGTTGTTTTAAACATGCATTGGGTTGTGATTATGTTTACAATCCGT 

^ 1 , ~ , , h 1 1 : — I 1 ~h 1 ' 1 1 ' H T~« K~ . 1 7664 

AACACCATTAAGACGGTGAACAATATTAAGTCAATCATTACTTATAACAACAAAATTTGT ACGTAACCCAACACTAATACAAATGTTAGGCA 

CGNSATCYNSVSNEYCCFKHALGCDYVYNP 
; : Replicase 1b ! 



ATGCTTTTGATATACAACAGTGGGGTTATGTTGGTTCCTTGAGCCAGAACCACCACACGTTCTGTAACATTCATAGAAACGAGCATGATGCT 

h 1 1 1 1 1 1 1 *— I 1 1 1 1 1 1 • 1 h- 17756 

TACGAAAACTATATGTTGTCACCCCAATACAACCAAGGAACTCGGTCTTGGTGGTGTGCAAGACATTGTAAGTATCTTTGCTCGTACTACGA . 

YAFO I QQWGY. VGS LSQNHHTFCN I H.RN EHOA 
Replicase 1b ! 



TCTGGTGATGCTGTTATGACACGTTGTTTGGCAGTACATGATTGTTTTGTCAAAAATGTTGATTGGACTGTAACGTACCCCTTTATTGCAAA 

1 H H 1 1 1 h« 1 ■ I 1 1 * 1 H I I ' 1 ■ 17846 

AGACCACTACGACAATACTGTGCAACAAACCGTCATGTACTAACAAAACAGTTTTTACAACTAACCTGACATTGCATGGGGAAATAACGTTT 

SGOAVMTRCLAVHDCFVKNVDWTVTYPF IAN 
Replicase 1b : 



TGAGAAATTTATCAATGGCTGTGGGCGTAATGTCCAGGGACATGTTGT7CGCGCAGCCTTGAAATTGTATAAACCTAGTGTTATTCATGATA 

^ 1 1 . 1 1 1 h \ ~— H f ■ 1- 17940 

ACTCTTTAAATAGTTACCGACACCCGCATTACAGGTCCCTGTACAACAAGCGCGTCGGAACTTTAACATATTTGGATCACAATAAGTACTAT 

EKF I NGCGRNVQGHVVRAA-LKLYKPSV I HO 

Replicase 1b . 



TTGGTAATCCTAAAGGTGTACGTTGTGCTGTTACTGATGCCAAATGGTACTGTTATGACAAGCAACCTGTTAATAGTAATGTCAAGTTGTTG 

, 1 , 1 1 1 1— -H 1 I i 1 I , I 1 — 18032 

AACCATTAGGATTTCCACATGCAACACGACAATGACTACGGTTTACCATGACAATACTGTTCGTTGGACAATTATCATTACAGTTCAACAAC 



IGNPK GVRCAVTDAKWYCYDKQPVN5NVKLL 
Replicase 1b ; 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/04981 4 PCT/NL2004/000805 

23/87 



GATTATGATTATGCAACCCATGGTCAACTTGATGGTCTTTGTTTATTCTGGAATTGTAATGTTGATATGTATCCAGAATTTTCAATTGTGTG 

— 1 1 ' 1 1 ' 1 — i 1 1 1 ■ 1 1 1 18124 

CTAATACTAATACGTTGGGTACCAGTTGAACTACCAGAAACAAATAAGACCTTAACATTACAACTATACATAGGTCTTAAAAGTTAACACAC 

QYOYATHGQL DGLCLFWNCNVOMYPEFSIVC 
: : : Replicase 1 b 

TCGCTTTGACACACGTACTCGTTCTGTTTTTAATTTAGAAGGTGTTAATGGTGGTTCTCTTTATGTTAACAAACATGCGTTTCATACACCAG 

h H i . i i ... | i H • ~H h 1 | i 1 1 1 ■ \ \ 1 1 18216 

AGCGAAACTGTGTGCATGAGCAAGACAAAAATTAAATCTTCCACAATTACCACCAAGAGAAATACAATTGTTTGTACGCAAAGTATGTGGTC 

RFOT R TRSVFNLEGVNGGSLYVNKHArHTP 
. Replicase 1b- — — 



CATATGATAAACGTGCTTTTGTTAAATTAAAACCTATGCCCTTTTTTTACTTTGATGACAGTGATTGTGATGTTGTGCAAGAACAAGTTAAT 

1 1 1 1 1 1 — i I 1 I I I 1 18308 

GTATACTATTTGCACGAAAACAATTTAATTTTGGATACGGGAAAAAAATGAAACTACTGTCACTAACACTACAACACGTTCTTGTTCAATTA 

A. YDKRAFVKLKPMPFFYFODSDCDVV. QEQVN 
—Replicase 1b 



TATGTACCCCTTCGCGCTAGTAGTTGTGTTACCCGTTGTAATATAGGTGGTGCTGTTTGTTCAAAACATGCAAATTTGTATCAAAAATATGT 

-H ■ | -H 1 1 1 ' I 1 ' I ' ' ■ ! I i ' 1 18400 

ATACATGGGGAAGCGCGATCATCAACACAATGGGCAACATTATATCCACCACGACAAACAAGTTTTGTACGTTTAAACATAGTTTTTATACA 

YVPLRASSCVTRCNI GGAVCSKHANLYOKYV 
— ; ■ — — — Replicase 1 b — — — — __ _ __ __ 



TGAGGCATATAATACATTTACACAGGCTGGTTTTAACATTTGGGTACCACATAGTTTTGATGTTTATAATTTGTGGCAAATTTTTATTGAAA 
1 H >— » h , 1 H \ 1— e-H 1 — 18492 

actccgtatattatgtaaatgtgtccgaccaaaattgtaaacccatggtgtatcaaaactacaaatattaaacaccgtttaaaaataacttt 
eayntftoagfn'iwvphsfdvynlwoifie 

Replicase. 1 b ■ — '■ : — 



ctaatttacaaagtcttgaaaatatagcatttaatgttgtaaaaaaagggtgttttactggtgttgatggtgagttacctgttgcagttgtt 

1 1 H ' H ^ 1 1 1 1 — H H 18584 

gattaaatgtttcagaacttttatatcgtaaattacaacatttttttcccacaaaatgaccacaactaccactcaatggacaacgtcaacaa 

tnlosleniafnvvkkgcftgvogelpvavv 
—Replicase 1b 



aacgacaaagtttttgttcgctatggcgatgttgacaacttggtttttacaaataaaacaacattgcctactaatgttgcttttgaattgtt 

' ' 1 * r* 1 " 1 1 1 , 1 I 1 18676 

ttgctgtttcaaaaacaagcgataccgctacaactgttgaaccaaaaatgtttattttgttgtaacggatgattacaacgaaaacttaacaa 

NOKVFVRYGDVDNLV.FTNK TTLPTNVAFELF 
" Replicase 



TGCAAAACGAAAAATGGGTTTAACACCACCATTGTCTATTCTCAAAAATCTTGGTGTTGTTGCTACATATAAATTTGTTTTATGGGATTATG 

1 " 1 1 1 ' h h | ... i 1 , 1 1 I I 1— 18768 

ACGTTTTGCTTTTTACCCAAATTGTGGTGGTAACAGATAAGAGTTTTTAGAACCACAACAACGATGTATATTTAAACAAAATACCCTAATAC 

AKRKMGL TPPLS I LKNLGVVATYKFVLWDY 
— — — Replicase 1b • 



AAGCTGAAAGACCTTTTACCTCATATACTAAGAGTGTATGTAAATACACTGATTTTAATGAGGATGTTTGTGTTTGTTTTGACAATAGTATT 

'* 1 1 1 1 ' » I 1 1 1 1 1 1 K 18860 

TTCGACTTTCTGGAAAATGGAGTATATGATTCTCACATACATTTATGTGACTAAAATTACTCCTACAAACACAAACAAAACTGTTATCATAA 

EAERPFTSYTKSVCKYTOFNEOVCVCFONSI 
• Replicase 1 b 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

24/87 



CAGGGTTCGTATGAGCGTTTTACGCTTACTACGAACGCTGTTTTATTTTCTACTGTTGTCATTAAAAATTTAACACCTATAAAGTTGAATTT 

1 1 1 . 1 . 1 . > .|. i H 1 ■ ■ ■ I . 1 — • 1 — 18952 

GTCCCAAGCATACTCGCAAAATGCGAATGATGCTTGCGACAAAATAAAAGATGACAACAGTAATTTTTAAATTGTGGATATTTCAACTTAAA 

QGSYERFTLTTNAVLFSTVVIKNLTPIKLNF 
1 Replicase 1b 



TGGTATGTTGAATGGTATGCCAGTTTCTTCTATTAAGAGTGATAAAGGTGTTGAAAAATTAGTTAATTGGTA€ACATATGTTCGTAAAAATG 

— , 1 . 1 *h H-^ 1 1 1 1 1 1 1 1 1 1 1 19044 

ACCATACAACTTACCATACGGTCAAAGAAGATAATTCTCACTATTTCCACAACTTTTTAATCAATTAACCATGTGTATACAAGCATTTTTAC 



GMLNGMPVSS 1 KSDKGVEKLVNWYTYVRKN 
Replicase 1b 



GTCAATTTCAAGATCATTATGATGGTTTTTACACTCAAGGTAGGAATTTATCAGACTTTACACCAAGAAGTGATATGGAGTATGATTTTCTT 

h 1 1 ■ i ) 1 I i 1 i 1 1 1 1— H 1 1 1 1 1- 19136 

CAGTTAAAGTTCTAGTAATACTACCAAAAATGTGAGTTCCATCCTTAAATAGTCTGAAATGTGGTTCTTCACTATACCTCATACTAAAAGAA 

GOFQDHYDGF Y TOGRNLSOFTPRSDMEYDFL 
— Replicase 1b 



AACATGGATATGGGTGTTTTTATTAATAAATATGGTCTTGAGGATTTTAATTTTGAACATGTTGTATA7GGTGATGTTTCAAAAACTACATT 

__H -h 1 1 1 1 1 • I— i 1 I ' " 1 I : h 19228 

TTGTACCTATACCCACAAAAATAATTATTTATACCAGAACTCCTAAAATTAAAACTTGTACAACATATACCACTACAAAGTTTTTGATGTAA 

NMDMGVF 1 NKYGLEOFNFEHVVYGDVSK TTL 
— Replicase 1b 



AGGAGGTCTTCATTTGTTGATATCACAGTTTAGGCTTAGTAAAATGGGTGTTTTGAAAGCTGATGATTTTGTCACTGCTTCTGACACAACTT 

1 i 1 h ! 1 1 1 1 1 h- hi— h 1 ■ 1 1 1 I I 19320 

TCCTCCAGAAGTAAACAACTATAGTGTCAAATCCGAATCATTTTACCCACAAAACTTTCGACTACTAAAACAGTGACGAAGACTGTGTTGAA 

GGLHLL ! SOFRLSKMGVL KADOFVTASDTT 
— — — Replicase 1b 



TGAGGTGCTGTACTGTTACTTATCTTAATGAACTTAGTTCAAAAGTTGTTTGTACTTATATGGATTTGTTGTTGGACGACTTTGTTACTATA 

, 1 1 ~H . H 1 1 — H h H 1 -H 1 1 1 1 19412 

ACTCCACGACATGACAATGAATAGAATTACTTGAATCAAGTTTTCAACAAACATGAATATACCTAAACAACAACCTGCTGAAACAATGATAT 

LRCC TVTYLNELSSKVVCTYMOLLLODFVT ( 
1 Replicase 1b •■ — 



CTAAAGAGTTTAGATCTTGGTGTAATATCTAAAGTTCATGAAGTTATTATAGATAATAAACCTTATAGGTGGATGTTGTGGTGTAAAGATAA 

^ , < 1 I I 1 h h H • i h I I 19504 

GATTTCTCAAATCTAGAACCACATTATAGATTTCAAGTACTTCAATAATATCTATTATTTGGAATATCCACCTACAACACCACATTTCTATT 

LKSLOL GVISKVHEVI IDNKPYRWMLWCKON 
: Replicase 1b — : : 1 



CCACTTGTCGACTTTTTATCCACAGTTGCAGTCTGCTGAATGGAAGTGTGGTTATGCTATGCCACAAATTTATAAGCTTCAACGTATGTGTT 

» ■ ' ■ I 1 1 • 1 1 ' I I ■ 1 i 1 > ' I I 1 1 1 i 1 19596 

GGTGAACAGCTGAAAAATAGGTGTCAACGTCAGACGACTTACCTTCACACCAA7ACGATACGGTGTTTAAATATTCGAAGTTGCATACACAA 

HLSTFYPOLOSAEWKCGYAMPO I YKLORMC 
— Replicase 1b 



TGGAACCTTGTAATTTATATAATTATGGTGCTGGTATTAAGTTGCCTAGTGGTATAATGTTAAATGTTGTTAAATACACTCAGCTTTGTCAA 

1 ■ ■ , i >■■ -[ ( , j h— I ~* 1 ~h 1 1 H « 1 19688 

ACCTTGGAACATTAAATATAT.TAATACCACGACCATAATTCAACGGATCACCATATTACAATTTACAACAATTTATGTGAGTCGAAACAGTT 

LEPCNLYNYGAG I K .LPSG I MLNVVK YTQLCO 
Replicase 1b 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 
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TACCTAAATAGCACTACAATGTGCGTACCTCATAATATGCGTGTTTTGCACTATGGTGCTGGTTCTGACAAAGGTGTGGCACCTGGTACAAC 

H r-« 1 1 ! > ■ ■ 1 • 1 1 I t I 1 1 1 » ■ ■ t 19780 

ATGGATTTATCGTGATGTTACACGCATGGAGTATTATACGCACAAAACGTGATACCACGACCAAGACTGTTTCCACACCGTGGACCATGTTG 

YLNSTTMC. VPHNMRVLHYGAGSOKGVAPGTT 
: Replicas© 1b : 



TGTTTTAAAACGTTGGCTACCACCTGATGCAATAATCATTGATAATGATATCAATGATTATGTTAGTGATGCAGATTTTAGCATTACAGGTG 

1 , ~| , 1 1 1 , 1 h 1 I 1 — 19872 

ACAAAATTTTGCAACCGATGGTGGACTACGTTATTAGTAACTATTACTATAGTTACTAATACAATCACTACGTCTAAAATCGTAATGTCCAC 

VLKRWLP POAI ! 10NDINDYVSDADFSI TG 
: Replicase lb 



ATTGTGCTACTGTTTACCTTGAAGATAAGTTTGACTTACTTATTTCTGATATGTATGATGGTAGAATTAAATTTTGTGATGGTGAAAACGTC 

— < H 1 h 1 ! " 1 =1 1 1 «~ 1 1 1 1 1 19964 

TAACACGATGACAAATGGAACTTCTATTCAAACTGAATGAATAAAGACTATACATACTACCATCTTAATTTAAAACACTACCACTTTTGCAG 

DCATVYIEOKFDIL I SOMYOGR I KFCDGENV 
■ Replicase 1b 



TCTAAAGATGGTTTTTTTACTTATCTTAATGGTGTTATTAGAGAAAAATTAGCTATTGGTGGTAGTGTTGCCATTAAGATTACAGAATATAG 

> ■ ■ | 1 1 1 h ■ ■ i 1 ■ ■ ■ i 1 1 1 1 h~ I 1 i 20056 

AGATTTCTACCAAAAAAATGAATAGAATTACCACAATAATCTCTTTTTAATCGATAACCACCATCACAACGGTAATTCTAATGTCTTATATC 

SKOGFFTYLNGVIREKLAIGGSVAIK I TEYS 

Replicase 1b 



TTGGAATAAGTATCTTTATGAATTAATACAAAGATTTGCTTTTTGGACTTTGTTCTGCACGTCTGTTAATACATCCTCTTCAGAAGCTTTTC 

! 1 1 ■ i 1 1 1 1 1 , ■ ■ i I h 1 1 HH 1 1 20148 

AACCTTATTCATAGAAATACTTAATTATGTTTCTAAACGAAAAACCTGAAACAAGACGTGCAGACAATTATGTAGGAGAAGTCTTCGAAAAG 

WNKYLYEL I QRFAFWTLF CTSVNTSSSEAF 
— ■ Replicase 1b— — — 



TTATTGGTATTAATTATTTAGGTGACTTTATTCAAGGTCCTTTTATAGCTGGTAACACTGTTCATGCTAATTATATATTTTGGCGTAATTCT 

H . 1 1 1 1 1 1 H 1 H h 1 h 1 1 1 1 H 20240 

AATAACCATAATTAATAAATCCACTGAAATAAGTTCCAGGAAAATATCGACCATTGTGACAAGTACGATTAATATATAAAACCGCATTAAGA 

LIGINYLGOFI Q G PF I AGNTVHANY 1 F W R N S 
■ Replicase 1b 



ACTATTATGTCTTTGTCATACAATTCAGTTTTAGATTTAAGTAAGTTTGAATGTAAACATAAGGCCACTGTTGTTGTTACACTTAAAGATAG 

— _ 1 . 1 i I 1 " I I i-~H 1 ! 1 I ■ 1 — 20332 

TGATAATACAGAAACAGTATGTTAAGTCAAAATCTAAATTCATTCAAACTTACATTTGTATTCCGGTGACAACAACAATGTGAATTTC7ATC 

T I M SLSYNSVLOLSKF ECKHKATVVV TL KDS 
— Replicase 1b : 



TGATGTAAATGATATGGTTTTGAGTTTGATTAAGAGTGGTAGGTTGTTGTTACGTAATAGTGGCCGTTTTGGTGGTTTTAGTAATCATTTAG 

— , 1 1 1 . 1 1 — ~h In 1 h 1 1 I ■ i ' ' -4 20424 

ACTACATTTACTATACCAAAACTCAAACTAATTCTCACCATCCAACAACAATGCATTATCACCGGCAAAACCACCAAAATCATTAGTAAATC 

OVNOMVLSL I KSGR LLIRNSGRFGGFSNHL 
Replicase 1b ■ 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

26/87 



TCTCAACTAAATGAAACTTTTCTTGATTTTGCTTATTTTGCCCCTGGTTTCTTGCTTTTCTACATGTAACAGTAATGCTAGTATTTCTATGT 

■* j ' 1 ' H " 1 1 f— h \ 1 1 • 1 >— H »- 20516 

AGAGTTGATTTACTTTGAAAAGAACTAAAACGAATAAAACGGGGACCAAAGAACGAAAAGATGTACATTGTCATTACGATCATAAAGATACA 

.MKLFL I L L I IPlVSCFSTCNS. NAS I S M 
I —Splice 

V S T K 

— Repiicase 1b — 1 

TACAATTAGGTGTTCCTGATAACTCTTCAACTATTGTCACAGGTTTGTTGCCAGTCCATTGGATTTGT.GCTAATCAGAGTACATCTAGTTAC 

i 1 1 H 1 1 • 1 1 H • 1— 1 1 20608 

ATGTTAATCCACAAGGACTATTGAGAAGTIGATAACAGTGTCCAAACAACGGTCAGGTAACCTAAACACGATTAGTCTCATGTAGATCAATG 

LOLGVPONSSTIVTGLLPVHWICANQSTSSY 
■ Spike 



CCAGCCAACGGCTTTTTCTATATTGATGTTGGTAAACACCGTAGTGCCTTTGCACTCCATAGTGGTTATTATGATGCTAACCAGTATTATAT 

1 i 1 . -H i I 1 1 ~H ■ ■ 1 1 ■ 1 ■ ' 1 h 20700 

GGTCGGTTGCCGAAAAAGATATAACTACAACCATTTGTGGCATCACGGAAACGTGAGGTATCACCAATAATACTACGATTGGTCATAATATA 

PANGFFY I OVGKHRSAFALHSGYYDANQYY I 
Spike 



TTATCTCACTAATAAAATACATTTAAATGCTCCTGTCACTCTGAAGATTTGTAAGTTTGGAAACACTTCTTTTGATTTTTTAAGTAATGTTT 

i. l i I i I I ■ ■ i ■ I i I I I I 20792 

AATAGAGTGATTATTTTATGTAAATTTACGAGGACAGTGAGACTTCTAAACATTCAAACCTTTGTGAAGAAAACTAAAAAATTCATTACAAA 

YLTNK I HLNAPVTLK I CKFGNTSF. OFLSNV 
Spike 



CTACTTCTCATGATTGTATAGTTAATTTGTCATTCACAGAACAGTTAGGTGTGCCTTTGGGCATAACTATATCGGGTGAAACTGTACGTTTG 

— , 1 i ■ i i i 1 1 H 1 j h-hH h~H I I ■ i H 20884 

GATGAAGAGTACTAACATATCAATTAAACAGTAAGTGTCTTGTCAATCCACACGGAAACCCGTATTGATATAGCCCACTTTGACATGCAAAC 

STSHOC I VNLSFTEOLGVPLG I T I SGE TVRL 
. ■ : Spike : : 



CATTTATATAATGCAACTCGTACTTTTTATGTGCCGGCCGCTTATAAACTTACTAAACTTAGTGTTAAATGTTACTTTAGTGAATCCTGTGT 

h h-H 1 1 h 1 hn 1— 1 h I 1 I 20976 

GTAAATATATTACGTTGAGCATGAAAAATACACGGCCGGCGAATATTTGAATGATTTGAATCACAATTTACAATGAAATCACTTAGGACACA 

HIYNATRTFYVPAAYKITKLSVKCYFSESCV 
Spike 



TTTTAGTGTTGTCAATGCCACCATTACTGTTAATGTCACCACACTTAATGGCCGTATAGTTAACTACACfGTTTGTGATGATTGTAATGGTT 

j , 1 1 1 1 1 1 ■ ■ ■ I 1 1 1 » : — 21068 

AAAATCACAACAGTTACGGTGGTAATGACAATTACAGTGGTGTGAATTACCGGCATATCAATTGATGTGACAAACACTACTAACATTACCAA 

FSVVNAT I TVNVTTLNGR I VNYTVCOOCNG 
— Spike ; 



ATACTGATAACATATTTTCTGTTCAACAGGATGGCCGCATTCCTAATGGTTTCCCTTTTAATAATTGGTTTTTGTTAACTAATGGTTCCACA 

_t , 1 1 1 . 1 >— - \ 1 i-H h 1 1 1 i I ' — + 21160 

TATGACTATTGTATAAAAGACAAGTTGTCCTACCGGCGTAAGGATTACCAAAGGGAAAATTATTAACCAAAAACAATTGATTACCAAGGTGT 

YTONIFSVQODGRIPNGFPFNNWFLLTNGST 
Spike : 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

27/87 



TTAGTGGACGGGGTCTCTAGACTTTATCAAC CACTCCGTTTAACTTGTTTATGGCCTGTACCTGGTCTTAAATCTTCAACTGGTTTTGTTTA 

' 1 1 1 1 ' 1 1 1 1 i ■ ' I i i , | 21252 
AATCACCTGCCCCAGAGATCTGAAATAGTTGGTGAGGCAAATTGAACAAATACCGGACATGGACCAGAATTTAGAAGTTGACCAAAACAAAT 

LVOGVSRLYQ P LRLTCLWPV PGLK SS TGFVY 
" ~ Spike ; ■ — 

TTTTAATGCCACTGGTTCTGATGTTAATTGTAACGGCTATCAACATAATTCTGTTGCTGATGTTATGCGTTACAATCTTAACCTCAGTGCTA 

— 1 ' 1 1 1 1 1 1 ' 1 1 1 1 1 \ 1 21344 

AAAATTACGGTGACCAAGACTACAATTAACATTGCCGATAGTTGTAJTAAGACAACGACTACAATACGCAATGTTAGAATTGGAGTCACGAT 

F N A T G S D VNCNGYQHNS VADVMRYNLNLSA 
; ! • Spike - . _ 



ATTCTGTGGACAATCTTAAGAGTGGTGTTATAGTTTTTAAAACTTTACAGTACGATGTTTTGTTTTATTGTAGTAATTCTTCTTCAGGTGTT 

■* ~* 1 1 1 1 ' 1 K—h 1 1— H 1 H 1 H ~— 21436 

TAAGACACCTGTTAGAATTCTCACCACAATATCAAAAATTTTGAAATGTCATGCTACAAAACAAAATAACATCATTAAGAAGAAGTCCACAA 

N 5 V 0 N LKSGVIVFKTLOYOVLFYCSN.SSS GV 
■ Spike ' • : 



CTTGACACCACAATACCTTTTGGCCCTTCCT CTCAACCTTATTACTGTTTTATAAACAGTACTATCAACACTACTCATGTTAGCACTTTTGT 

' ' ' * ' 1 ' 1 1 1 1 1 11 ' ' 1 1 I i ■ I ■ i | i ■ | i ' 21528 

GAACTGTGGTGTTATGGAAAACCGGGAAGGAGAGTTGGAATAATGACAAAATATTTGTCATGATAGTTGTGATGAGTACAATCGTGAAAACA 

L 0 T T I PFGPSSQPYYCFINSTINTTHVSTFV 
~ Spike : 

GGGTATTTTACCACCCACTGTGCGTGAAATTGTTGTTGCTAGAACTGGTCAGTTTTATATTAATGGTTTTAAGTATTTCGATTTGGGTTTCA 

1 1 1 1 I h 1 i ^ h— H 1— *-\ 1 h 21620 

CCCATAAAATGGTGGGTGACACGCACTTTAACAACAACGATCTTGACCAGTCAAAATATAATTACCAAAATTCATAAAGCTAAACCCAAAGT 

_GIL_P P TVREIVVART-GQFY INGF'KYFDLGF 
; Spike 1 



TAGAAGCTGTCAATTTTAATGTCACGACTGCTAGTGCCACAGATTTTTGGACGGTTGCATTTGCTACTTTTGTTGATGTTTTGGTTAATGTT 

— h 1 " h 1 ■ ■ • ■ ■ I i ■ ■ i 1 . 1 1 .... | 1 | , 1 — 21712 

ATCTTCGACAGTTAAAATTACAGTGCTGACGATCACGGTGTCTAAAAACCTGCCAACGTAAACGATGAAAACAACTACAAAACCAATTACAA 

[ E A V N F N VTTASATDFWTVAFATFVOVLVNV 
' Spike ■ — — 



AGTGCAACTAACATTCAAAACTTACTTTATTGCGATTCTCCATTTGAAAAGfTGCAGTGTGAGCACTTGCAGTTTGGATTGCAAGATGGTTT 

I 1 1 1 "I H 1 i 1 h ■ | i i. i 1 ■ l 1 21804 

TCACGTTGATTGTAAGTTTTGAATGAAATAACGCTAAGAGGTAAACTTTTCAACGTCACACTCGTGAACGTCAAACCTAACGTTCTACCAAA 

s A T N 1 Q N L LYCOSPFEKLQCEHLQFG LQOGF 
— Spike — 

TTATTCTGCAAATTTTCTTGATGATAATGTTTTGC CTGAGACTTATGTTGCACTCCCCATTTATTATCAACATACGGACATAAATTTTACTG 

i > i 1 ■ 1 ' ■ ' \ | - ) f i { | 21896 

AATAAGACGTTTAAAAGAACTACTATTACAAAACGGACTCTGAATACAACGTGAGGGGTAAATAATAGTTGTATGCCTGTATTTAAAATGAC 

Y S A N F L ODNVLP ETYVALP 1YY0HT0 ! NFT 
~ ' — Spike 



CAACTGCATCTTTTGGTGGTTCTTGTTATGTTTGTAAACCACGCCAGGTTAATATATCTCTTAATGGTAACACTTCAGTGTGTGTTAGAACA 

1 ' 1 I 1 1 1 ■ ' 1 ' 1 i i . ■ i , | . j i i i i i | | 21988 

GTTGACGTAGAAAACCACCAAGAACAATACAAACATTTGGTGCGGTCCAATTATATAGAGAATTACCATTGTGAAGTCACACACAATCTTGT 

ATA5FGGS CYVCKPR0VN1 SLNGNTSVCVRT 
Spike 



SUBSTITUTE SHEET (RULE 26) 
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TCTCATTTTTCAATTAGGTATATTTATAACCGCGTTAAGAGTGGTTCACCAGGTGACTCTTCATGGCATATTTATTTAAAGAGTGGCACTTG 

H 1 1 • 1 ' ■ i ■ 1 p 1 . i i 1 i 1 1 1 —f 22080 

AGAGTAAAAAGTTAATCCATATAAATATTGGCGCAATTCTCACCAAGTGGTCCACTGAGAAGTACCGTATAAATAAATTTCTCACCGTGAAC 

SHFSIRYIYNRVKSGSPG O S S W H I Y L K S G T C 
Spike ; 



TCCATTTTCTTTTTCTAAGTTAAATAATTTTCAAAAGTTTAAGACTATTTGTTTCTCAACCGTCGAAGTGCCTGGTAGTT6TAATTTTCCAC 

I ' I ' 1 H • 1 ' I ' H > 1 1 1 — 22172 

AGGTAAAAGAAAAAGATTCAATTTATTAAAAGTTTTCAAATTCTGATAAACAAAGAGTTGGCAGCTTCACGGACCATCAACATTAAAAGGTG 

PFSF SKLNNFOKFKT I CF6TVEVPGSCNFP 
Spike 



TTGAAGCCACCTGGCATTACACTTCTTATACTATTGTTGGTGCTTTGTATGTTACTTGGTCTGAAGGTAATTCCATTACTGGTGTACCTTAT 

— i i 1 1 ■ ■ ■ i 1 1 1 1 1 1 1 1 ■ >. , .■■! l ■ ■ ■ 22264 

AACTTCGGTGGACCGTAATGTGAAGAATATGATAACAACCACGAAACATACAATGAACCAGACTTCCATTAAGGTAATGACCACATGGAATA 

LEATWHYTSY'T IVGALYVTWSEGNSI TGVPY 
Spike : 



CCTGTCTCTGGTATTCGTGAGTTTAGTAATTTAGTTTTAAATAATTGTACCAAATATAATATTJATGATTATGTTGGTACTGGAATTATACG 

h 1 1 1 1 1 — i 1 h 1 h ! i i 1 1 -H »~ 22356 

GGACAGAGACCATAAGCACTCAAATCATTAAATCAAAATTTATTAACATGGTTTATATTATAAATACTAATACAACCATGACCTTAATATGC 

PVSG I REFSNLVLNNCTKYN I YOYVG TG I I R 



TTCTTCAAACCAGTCACTTGCTGGTGGTATTACATATGTTTCTAACTCTGGTAATTTACTTGGTTf TAAAAATGTTTCCACTGGTAACATTT 

— H ~h 1 1 H • 1 ■ ■ ■ i H M 1 \ — j— * H 1 22448 

AAGAAGTTTGGTCAGTGAACGACCACCATAATGTATACAAAGATTGAGACCATTAAATGAACCAAAATTTTTACAAAGGTGACCATTGTAAA 

SSNOSLAGGITYVSNSGNLLGFKNVSTGN! 
« ; Spike 1 : : ! 

TTATTGTGACACCATGTAACCAACCAGATCAAGTAGCTGTTTATCAACAAAGCATTATTGGTGCCATGACCGCTGTTAATGAGTCTAGATAT 

H 1 1 ' 1 >■■■■ ! ' . • 1 ~> H 1 1 ' ■ i • 'I 1 1~ h 22540 

AATAACACTGTGGTACATTGGTTGGTCTAGTTCATCGACAAATAGTTGTTTCGTAATAACCACGGTACTGGCGACAATTACTCAGATCTATA 

F 1 V TP C NOPDO VAVYOQS I IGAMTAVNESRY 

Spike 



GGCTTGCAAAACTTACTACAGTTACCTAACTTTTATTATGTTAGTAATGGTGGTAACAATTGCACTACGGCTGTTATGATTTATTCTAATTT 

■ ■ i ■ I I ) I 1 < I I", i | | I ■ ■ 22632 

CCGAACGTTTTGAATGATGTCAATGGATTGAAAATAATACAATCATTACCACCATTGTTAACGTGATGCCGACAATACTAAATAAGATTAAA 

GLONLLQLPNFYYVSNGGNNCTTAVMIYSNF 
Spike 1 



TGGTATTTGTGCTGATGGTTCTTTAATTCCTGTTCGTCCGCGTAATTCTAGTGATAATGGTATTTCAGCCATAATCACTGCTAATTTATCCA 

— , 1 , 1 , H 1 1 ■ I 1 1 1 h 1 22724 

ACCATAAACACGACTACCAAGAAATTAAGGACAAGCAGGCGCATTAAGATCACTATTACCATAAAGTCGGTATTAGTGACGATTAAATAGGT 

G1CA0GSLIPVRPRNSSDNGIS A1 ITANLS 
Spike 



TTCCCTCTAACTGGACTACTTCAGTTCAAGTTGAGTACCTCCAAATTACTAGTACTCCAATAGTTGTTGATTGTGCTACTTATGTGTGTAAT 

4 1 1 1 h-« 1 1 1 1 " 1 ' H 1 ' " I 1 »- 22816 

AAGGGAGATTGACCTGATGAAGTCAAGTTCAACTCATGGAGGTTTAATGA7CATGAGGTTATCAACAACTAACACGATGAATACACACATTA 

IPSNWTTSVQVEYLQITSTPIVVOCATYVCN 
Spike 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

29/87 



GGTAACCCTCGTTGTAAGAATCTACTTAAGCAGTATACTTCTGCTTGiAAAACTATTGAAGATGCCTTACGACTTAGTGCTCATTTGGAAAC 

1 1 1 1 h— i 1 1 1 1 1 1 » I ■ i 1 i 22908 

CCATTGGGAGCAACATTCTTAGATGAATTCGTCATATGAAGACGAACATTTTGATAACTTCTACGGAATGCTGAATCACGAGTAAACCTTTG 

GNPRCKNLLKQYTSACKT I EDALRLSAHLET 
. Spike : : 

TAATGATGTTAGTAGTATGCTAACTTTCGATAGCAATGCTTTTAGTTTGGCTAATGTTACTAGTTTTGGAGATTATAACCTTTCTAGTGTTT 

■ 1 ■ ' 1 ' \ 1 1 \ I 1 1 ' I >■■■ ( h 23000 

ATTACTACAATCATCATACGATTGAAAGCTATCGTTACGAAAATCAAACCGATTACAATGATCAAAACCTCTAATATTGGAAAGATCACAAA 

NDVSSMLTF O S NAFSLANVTSFGOYNLSSV 
Spike : : 



TACCTCAGAGAAACATTCATTCAAGCCGTATAGCAGGACGTAGTGCTTTGGAAGATTTGTTGTTTAGCAAAGTTGTTACATCTGGTTTGGGT 

> 1 t 1 »■■•■! 1 I i ■ > I » 1 I H — 23092 

ATGGAGTCTCTTTGTAAGTAAGTTCGGCATATCGTCCTGCATCACGAAACGTTCTAAACAACAAATCGTTTCAACAATGTAGACCAAACCCA 

LPQRN I HSSR I AGRSALEDLLFSKVVTSGLG 
— . Spike — 



ACTGTTGATGTTGACTATAAGTCTTGTACTAAAGGTCTTTCTATTGCTGACCTTGCTTGTGCTCAGTACTACAATGGCATAATGGTTTTGCC 

1 ~h 1 i 1 1 H 1 1 1 1 1 H" 1 1 h 1 23184 

TGACAACTACAACTGATATTCAGAACATGATTTCCAGAAAGATAACGACTGGAACGAACACGAGTCATGATGTTACCGTATTACCAAAACGG 

TVD VDYKSCTKGL S I.AOLACAQYYNG I MVLP 
Spike — — 



AGGTGTTGCTGATGCTGAACGTATGGCCATGTACACAGGTTCTCTTATAGGTGGCATGGTGCTCGGAGGTCTTACATCAGCAGCCGCCATAC 

h j 1 | 1 H 1 \ h 1 1 H 1 1 ' i > H 1- 23276 

TCCACAACGACTACGACTTGCATACCGGTACATGTGTCCAAGAGAATATCCACCGTACCACGAGCCTCCAGAATGTAGTCGTCGGCGGTATG 

GVAOAERMAMYTGSL I GGMVLGGLTSA-AA I 
— Spike 1 !i 



CTTTTTCTTTGGCACTGCAAGCACGACTTAACTATGTTGCTTTACAAACTGATGTGCTTCAAGAAAATCAGAAAATTTTGGCTGCATCATTT 

— , h~ 1 1 . ,.|.. . ... ^ 1 I ' " I ' 23368 

GAAAAAGAAACCGTGACGTTCGTGCTGAATTGATACAACGAAATGTTTGACTACACGAAGTTCTTTTAGTCTTTTAAAACCGACGTAGTAAA 

pF'SLALOARLNYVALOTDVLOENQK 1 LA ASF 
■. Spike : 



AATAAGGCTATTAATAATATTGTTGCTTCTTTTAGTAGCGTTAATGATGCTATTACACATACTGCAGAGGCTATACATACTGTTACTATTGC 

-H h 1 1 1 1 K~h 1 1 1 1 1 1 ' I I 1 »■ 23460 

TTATTCCGATAATTATTATAACAACGAAGAAAATCATCGCAATTACTACGATAATGTGTATGACGTCTCCGATATGTATGACAATGATAACG . 

NKA1NNIVASFSSVN0AITHTAEAIHTVTIA 
. Spike 



ACTTAATAAGATTCAGGATGTTGTTAATCAACAGGGTAGTGCTCTTAACCATCTCACTTCACAATTGAGACATAATTTTCAGGCCATTTCTA 

, , . 1 1 1 h 1 1 1 1 h~ H 1 1 — 23552 

TGAATTATTCTAAGTCCTACAACAATTAGTTGTCCCATCACGAGAATTGGTAGAGTGAAGTGTTAACTCTGTATTAAAAGTCCGGTAAAGAT 

LNK I Q OVV'N O QGS ALNHL T SQL R HN F Q A I S 
— Spike 



ATTCAATTCATGCTATTTATGACCGGCTTGATTCAATTCAAGCCGATCAACAAGTTGACAGATTAATTACTGGACGGCTTGCAGCTTTGAAT 

— , 1 , 1 • \ 1 1 , 1 , j H 1 ■ ■ ' 1 » ■ ■ ' I 23644 

TAAGTTAAGTACGATAAATACTGGCCGAACTAAGTTAAGTTCGGCTAGTTGTTCAACTGTCTAATTAATGACCTGCCGAACGTCGAAACTTA 

NSIHAIYORLOSIQAOOOVORL 1TGRLAALN 
Spike — 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 PCT/NL2004/000805 

30/87 



GCATTTGTTTCCCAAGTTTTGAATAAATATACTGAAGTTCGTGGTTCCAGACGCTTAGCACAGCAGAAGATTAATGAATGTGTCAAGTCACA 

< 1 ■ 1 1 1 1 1 — i 1 . K— 1 ■ 1 ■ ■ i 1 1 1 ~ 23736 

CGTAAACAAAGGGTTCAAAACTTATTTATATGACTTCAAGCACCAAGGTCTGCGAATCGTGTCGTCTTCTAATTACTTACACAGTTCAGTGT 



AFVSQ VLNKYTEVRGSRRLAOQKINECV. KSQ 
Spika : 

ATCTAATAGATATGGTTTTTGTGGCAATGGCACTCACATCTTTTCAATCGTCAACTCAGCTCCAGATGGTTTGCTTTTTCTTCATACTGTTT 

1 ■ 1 . 1 , 1 1 j , 1 1 1 ... i H- • 1 ■ i 23828 

TAGATTATCTATACCAAAAACACCGTTACCGTGAGTGTAGAAAAGTTAGCAGTTGAGTCGAGGTCTACCAAACGAAAAAGAAGTATGACAAA 



5NRYGFCGNGTHI FSIVNSAPDGLlFLHTV 
Spike • 



TGCTGCCAACTGATTACAAGAATGTAAAGGCGTGGTCTGGTATCTGTGTTGATGGCATTTATGGCTATGTTCTGCGTCAACCTAACTTGGTT 

H I I 1 1 « 1 H 1 -I I i I ' " I 23920 

ACGACGGTTGACTAATGTTCTTACATTTCCGCACCAGACCATAGACACAACTACCGTAAATACCGATACAAGACGCAGTTGGATTGAACCAA 



L LPTDYKNVKAWSG! CVOG I YGYVLRQPNLV 
= Spike : 



CTTTATTCTGATAATGGTGTCTTTCGT6TAACTTCCAGGGTCATGTTTCAACCTCGTTTACCTGTTTTGTCTGATTTTGTGCAAATATATAA 

, H . 1 1 !■■■■>■ H I 1 1 h 1 1 ~H — 24012 

GAAATAAGACTATJACCACAGAAAGCACATTGAAGGTCCCAGTACAAAGTTGGAGCAAATGGACAAAACAGAETAAAACACGTTTATATATT 



LY SONGVFRVTSRVMFOPRLPVLSOFVO I YN 
Spike 



TTGTAATGTTACTTTTGTTAACATATCTCGTGTCGAGTTACATACTGTCATACCTGACTACGTTGATGTTAATAAAACATTACAAGAGTTTG 

_h ^ , 1 , 1 , 1 , in , ( 1 1-» ■ i ■ lf ■ i 1 2410A 

AACATTACAATGAAAACAATTGTATAGAGCACAGCTCAATGTATGACAGTATGGACTGATGCAACTACAATTATTTTGTAATGTTCTCAAAC 



C NVTFVNI SRVELHTVIPOYVOVNKTLQEF 
. Spike • • 

CACAAAACTTACCAAAGTATGTTAAGGCTAATTTTGACTTGACTCCTTTTAATTTAACATATCTTAATTTGAGTTCTGAGTTGAAGCAACTC 

> ■ I 1 1— h 1 i 1 1 ■ ■ ■ | 1 ■ ■ I 1 1 1 24196 

GTGTTTTGAATGGTTTCATACAATTCGGATTAAAACTGAACTGAGGAAAATTAAATTGTATAGAATTAAACTCAAGACTCAACTTCGTTGAG 

AONL PK YV KP NFO LTPFNLT YLNLSSEL K QL 
Spike 



GAAGCTAAAACTGCTAGTCTTTTCCAAACTACTGTTGAATTACAAGGTCTTATTGATCAGATTAACAGTACATATGTTGATTTGAAGTTGCT 

j , 1 1 1 . 1 . 1 1 1 , 1 ■ ■ ■ i 1 1 r 2M288 

CTTCGATTTTGACGATCAGAAAAGGTTTGATGACAACTTAATGTTCCAGAATAACTAGTCTAATTGTCATGTATACAACTAAACTTCAACGA 



EAKTAS LFQTTVELQGL I OOINSTYVDLKLL 
Spike 

TAATAGGTTTGAAAATTATATCAAATGGCCTTGGTGGGTTTGGCTCATTATTTCTGTTGTTTTTGTTGTATTGTTGAGTCTTCTTGTGTTTT 

_H , 1 , j 1 1 , h 1 1 1 1 1 1 h 1 I 24380 

ATTATCCAAACTTTTAATATAGTTTACCGGAACCACCCAAACCGAGTAATAAAGACAACAAAAACAACATAACAACTCAGAAGAACACAAAA 

NRFENYIKW PWWVWL1 ISVVFVVLLSLLVF 
Spike 

GTTGTCTTTCTACAGGTTGTTGTGGTTGTTGCAATTGTTTAACTTCATCAATGCGAGGCTGTTGTGATTGTGGTTCAACTAAACTTCCTTAT 

, j 1 1 ) ■ ■ ■ — 1 h~ 1 i ■ ■ ■ I 1 1 ■ ■ ■ i I — ^2W2 

CAACAGAAAGATGTCCAACAACACCAACAACGTTAACAAATTGAAGTAGTTACGCTCCGACAACACTAACACCAAGTTGATTTGAAGGAATA 



CCLSTGCCGCCNCLTSShRGCCOCGSTKLPY 
■ Spike < 
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TATGAATTTGAAAAGGTCCACGTTCAATAATGCCTTTCGGTGGCCTATTTCAACTTACTCTTGAAAGTACTATTAATAAGAGTGTGGCTAAT 

7-h 1 1 1 , f , 1 , \ i 1 , 1 2 q56<* 

ATACTTAAACTTTTCCAGGTGCAAGTTATTACGGAAAGCCACCGGATAAAGTTGAATGAGAACTTTCATGATAATTATTCTCACACCGATTA 

YEFEKVHVQ. 



- Spike * 



MPFGGIFQITLEST I NKSVAN 



-ORF 4ab- 



CTCAAATTACCACCTCATGATGTTACTGTCTTGCGTGACAATCTTAAACCTGTTACTACACTTAGTACTATCACTGCTTATTTGTTAGTTAG 

h 1 . 1 . ! . 1— < 1 1 1 • 1 I 1 1 21656 

GAGTTTAATGGTGGAGTACTACAATGACAGAACGCACTGTTAGAATTTGGACAATGATGTGAATCATGATAGTGACGAATAAACAATCAATC 

LKLPPHDVTVLRDNLKPVTTLST | TAYLLVS 
- ORF 4ab— ■ 



TTTGTTTGTCACTTATTTTGCTTTATTCAAACCTCTTACTGCTAGAGGTCGCGTTGCTTGTTTTGTTTTAAAACTATTGACACTATCTGTCT 

1 . 1 . 1 th H 1 1 1 1— h 1 1 1 2W8 

AAACAAACAGTGAATAAAACGAAATAAGTTTGGAGAATGACGATCTCCAGCGCAACGAACAAAACAAAATTTTGATAACTGTGATAGACAGA 

LFVTYFALFKPLTARGRVAC F V LKLLTLSV 
: ORF 4ab — — 



ATGTGCCTTTATTGGTTCTTTTTGGTATGTATCTTGACAGTTTTATAATTTTTTTTCTACGCTGTTGTTTCGATTCATACATGTTGGCTATT 

H 1 1 . 1 1 1 1 H 1 1 1 'I 1 " 1 1 " i ■ *+ 2W0 

TACACGGAAATAACCAAGAAAAACCATACATAGAACTGTCAAAATATTAAAAAAAAGATGCGACAACAAAGCTAAGTATGTACAACCGATAA 

YVPLLVLFGMYLDSF ! IFFL RCCFDSYMLAI 
ORF 4ab 



ATGCCTATCTCTAATAAAAATTTTTCATTTGTTTTGTTCAATGTTACTAAACTATGCTTCGTTTCAGGCAAGTGTTGGTATCTTGAACAATC 

h 1 i h 1 H * 1 1 1 ~h H I 1 1 — 24932 

TACGGATAGAGATTATTTTTAAAAAGTAAACAAAACAAGTTACAATGATTTGATACGAAGCAAAGTCCGTTCACAACCATAGAACTTGTTAG 

MP I SNKNFSFVLFNVTKtCFVSGKCWYLEOS 
-ORF 4ab 



ATTTTATGAAAATCGTTTTGCTGCTATTTATGGTGGTGACCACTATGTCGTTTTAGGTGGTGAAACTATTACTTTTGTTTCTTTTGATGACC 

— , 1 , j , \ 1 1 1 . 1 1 H > ■ ■ ■ I ■ — I 25024 

TAAAATACTTTTAGCAAAACGACGATAAATACCACCACTGGTGATACAGCAAAATCCACCACTTTGATAATGAAAACAAAGAAAACTACTGG 

FYENRFAA I YGGOHYVVLGGET I TFVSFOO 
ORF 4ab ■ 



TTTATGTTGCTATTAGAGGTTCTTGTGAAAAGAACCTACAACTTATGCGTAAGGTTGACTTGTATAATGGTGCTGTCATTTACATTTTTGCC 

»■ ■ | i I h \ m | i ■ ■ | ■ i \ 1 I 1 i 251 16 

AAATACAACGATAATCTCCAAGAACACTTTTCTTGGATGTTGAATACGCATTCCAACTGAACATATTACCACGACAGTAAATGTAAAAACGG 

LYVA 1 RGSCEKNIQLHRKVDLYNGAV I Y I FA 
■ ORF 4ab — 



GAAGAGCCTGTTGTTGGTATAGTTTACTCCTCTCAACTATACGAAGATGTTCCTTCGATTAATTGATGACAATGGCATTGTCCTCAATTCTA 

1 *h j . j 1 ! 1 1 i 1 1 1 1 I i 1 ' ' 25208 

CTTCTCGGACAACAACCATATCAAATGAGGAGAGTTGATATGCTTCTACAAGGAAGCTAATTAACTACTGTTACCGTAACAGGAGTTAAGAT 

.ttFLRL I ODNG I VLNS 
I E 



EEPVVGIVYSSOLYEDVPSIN. 
0RF4ab 
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TTTTATGGCTCCTTGTTATGATATTTTTCTTTGTGTTGGCAATGACCTTTATTAAACTGATTCAATTGTGTTTTACTTGTCATTATTTTTTT 

H 1 1 « 1 1 \ 1 1 1 1 * ' 1 1 ! > ■ 1 H 1 1 1 25300 

AAAATACCGAGGAACAATACTATAAAAAGAAACACAACCGTTACTGGAAATAATTTGACTAAGTTAACACAAAATGAACAGTAATAAAAAAA 

ILWLLVM I FFFVIAMTF IKL 1QLCFTCHYFF 

. : _E 



AGIAGGACATTATATCAACCAGTTTATAAAATTTTTCTTGCTTACCAAGATTATATGCAAATAGCACCTGTTCCAGCTGAAGTACTAAATGT 

_4 1 1 1 1 1 1 | 1 1 1 m i — H h— H 1 1 — 25392 

TCATCCTGTAATATAGTTGGTCAAATATTTTAAAAAGAACGAATGGTTCTAATATACGTTTATCGTGGACAAGGTCGACTTCATGATTTACA 

SRTLYOPVYK IFLAYODYMQ 1APVPAEVLNV 
E 



CTAAACTAAACGATGTCTAATAGTAGTGTGCCTCTTTCAGAGGTTTATGTCCATTTACGTAACTGGAACTTTAGTTGGAATTTAATTCTAAC 

H m 1 1 1 1 1 1 |—i 1 • 1 25484 

GATTTGATTTGCTACAGATTATCATCACACGGAGAAAGTCTCCAAATACAGGTAAATGCATTGACCTTGAAATCAACCTTAAATTAAGATTG 

.HSNSSVPL SE V.YVHLRNWNFSWNL ILT 
-E J 1 ; M • ; ' 

AGTTTTTATAGTTGTGTTGCAGTATGGGCATTATAAGTATAGCAGACTTCTTTATGGTTTAAAGATGTCTGTTTTATGGTGTTTATGGCCAC 

H 1 i 1 1 1 1 — < h I 1 1 1 ' \ > 1 l 1 1 1 i r ' 25576 

TCAAAAATATCAACACAACGTCATACCCGTAATATTCATATCGTCTGAAGAAATACCAAATTTCTACAGACAAAATACCACAAATACCGGTG 

VF IVVLOYGH YK YSR LLYGLKMSV LWCLWP 
M 



TTGTTCTAGCTTTGTCTATTTTTGACTGTTTTGTCAATTTTAATGTGGACTGGGTCTTTTTTGGTTTTAGTATTCTTATGTCTATTATTACA 

1 1 1 1 1 1 1 'I 1 t-J 1 'i I ■ ■ ■ I 1 25668 

AACAAGATCGAAACAGATAAAAACTGACAAAACAGTTAAAATTACACCTGACCCAGAAAAAACCAAAATCATAAGAATACAGATAATAATGT 

LVLAL5I FDCFVNFNVOWVFFG FSILMS I IT 
M 



CTTTGTTTATGGGTTATGTATTTTGTTAATAGTTTCAGACTTTGGCGCCGTGTTAAAACTTTTTGGGCTTTTAATCCTGAAACTAATGCAAT 

' I I ' 1 ' 1 1 I ' 'I 1 I 1 ' I 25760 

GAAACAAATACCCAATACATAAAACAATTATCAAAGTCTGAAACCGCGGCACAATTTTGAAAAACCCGAAAATTAGGACTTTGATTACGTTA 

LCLWVMYFVNSFRLWRRVKTFWAF NPETNA I 
M 



CATCTCTCTCCAGGTTTATGGACATAATTATTACTTACCGGTGATGGCTGCACCTACAGGTGtf ACATTAACACTTCTTAGTGGTGTACTTC 

— h 1 ' 1 1 1 1 H 1 1 1 1 1 : 1 I — 25852 

GTAGAGAGAGGTCCAAATACCTGTATTAA7AATGAATGGCCACTACCGACGTGGATGTCCACAATGTAATTGTGAAGAATCACCACATGAAG 

1 SLQVYGHNYYLPVHAAPTGVTLTLLSGVL 
M 



TTGTTGATGGCCATAAGATTGCTACTCGTGTTCAAGTGGGTCAGTTGCCTAAATATGTAATAGTTGCTACACCTAGTACCACAATTGTTTGT 

— •— ' h 1 1 1 \ 1 1 1 1 ■ 1 -h >-H 1 1 25944 

AACAACTACCGGTATTCTAACGATGAGCACAAGTTCACCCAGTCAACGGATTTATACATTATCAACGATGTGGATCATGGTGTTAACAAACA 

LVOGH K ! ATR VOVGOLPKYV IV ATPSTT I VC 
M 



GACCGTGTTGGTCGCTCTGTTAATGAAACAAGCCAGACTGGTTGGGCATTCTACGTCCG7GCTAAACATGGTGATTTTTCTGGTGTTGCCTC 

^ 1 1 1 1 m K— i 1 1 » ' l i ■ I ■ ■ ■ i 1 26036 

CTGGCACAACCAGCGAGACAATTACTTTGTTCGGTCTGACCAACCCGTAAGATGCAGGCACGATTTGTACCACTAAAAAGACCACAACGGAG 

ORVGRSVNETSOTGWAFYVRAKHGDFSGVAS 
■■ M 
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TCAGGAGGGTGTTTTGTCAGAAAGAGAGAAGTTGCTTCATTTAATC 

AGTCCTCCCACAAAACAGTCTTTCTCT CTTCAACGAAGTAAATTAGATTTGATTTGTTTTACCGATCACATTT^^ 26128 



Q E G V I S E R E K L L H L 1 . , M . A S V N W A 0 0 R A 
" I N 



AAGAAATTTCCTCCTCCTTCATTTTACATGCCTCTTTTGGTTAGTTCTGATAAGGCACCATATAGGGTCATTCCCAGGAATCTT 



GT 



GCTAGG 

1 ' 1 1 * I 1 I 1 ■ * ' ■ ■ ■ 1 | ■ | | | ps??0 

CGATCCTTCTTTAAAGGAGGAGGAAGTAAAATGTACGGAGAAAACCAATCAAGACTATTCCGTGGTATATCCCAGTAAGGGTCCTTAGAACA 

ARKKFPPPSFVHPLL ^V SSDKAPY RVIPRNLV 

CCCTATTGGTAAGGGTAATAAAGATGAGCAGAT TGGTTATTGGAATGTTCAAGAGCGTTGGCGTATGCGCAGGGGGCAACGTGTTGATTTGC 

' 1 ' ' ' 1 ' ' 1 1 1 ' 1 i ■ | i | ! , j . 26312 

GGGATAACCATTCCCATTATTTCTACTCGTCTAACCAATAACCTTACAAGTTCTCGCAACCGCATACGCGTCCCCCGTTGCACAACTAAACG 

P .I GKGNKDEQ IGY WNVQERWRMRRGORVOL 
■ N - — — 



CTCCTAAAGTTCATTTTTATTACCTAGGTACTGGA CCTCATAAGGACCTTAAATTCAGACAACGTTCTGATGGTGTTGTTTGGGTTGCTAAG 

' • I i - i i | i | i ■ i , | j i i i ■ i i | i | 26404 

GAGGATTTCAAGTAAAAATAATGGATCCATGACCTGGAGTATTCCTGGAATTTAAGTCTGTTGCAAGACTACCACAACAAACCCAACGATTC 
PP. KVHFYYLGTGPHK D LKFRQRSDGVVWVAK 



GAAGGTGCTAAAACTGTTAATACCAGTCTTG GTAATCGCAAACGTAATCAGAAACCTTTGGAACCAAAGTTCTCTATTGCTTTGCCTCCAGA 

1 1 * 1 ' ' 1 ' 1 i ... | i | . « , | , i , j | 26496 

CTTCCACGATTTTGACAATTATGGTCAGAACCATTAGCGTTTGCATTAGTCTTTGGAAACCTTGGTTTCAAGAGATAACGAAACGGAGGTCT 
EGAK TVNTSLGNRKr 'n OKPLE.PKFSIALPPE 

GCTCTCTGTTGTTGAGTTTGAG GATCGCTCTAATAACTCATCTCGTGCTAGCAGTCGTTCTTCAACTCGTAACAACTCACGAGACTCTTCTC 

1 ' '. 1 ' ' ' ' 1 l ■ i i > i ■ i | . | i | i | ! . 2658B 

CGAGAGACAACAACTCAAACTCCTAGCGAGATTATTGAGTAGAGCACGATCGTCAGCAAGAAGTTGAGCATTGTTGAGTGCTCTGAGAAGAG 
ISVVEFEDRSNNSS R ^ A SSRSSTRNNSRDSS 



GTAGTACTTCAAGACAACAGTCTCGCACTCGTTCTGATTCTAACCAGTCTTCTTCAGATCTTGTTGCTGCTGTTACt 
CATCATGAAGTTCTGTTGTCAGAGCGTGAGCAAGACTAAGATTGGTCAGAAGAAGTCTAGAACAACGACGACAATGAAACCGAAATTTCTTG 
RSTSROQSRTRSOSN Q^ SSSOLVAAVTLALKN 



TTAGGTTTTGATAACCAGTCGAAGTCACCTAGTTC TTCTGGTACTTCCACTCCTAAGAAACCTAATAAGCCTCTTTCTCAACCCAGGGCTGA 

11 ' ' 1 ' 1 11 1 ,1 11 I ' I h I i i i i i ■ | | 26772 

AATCCAAAACTATTGGTCAGCTTCAGTGGATCAAGAAGACCATGAAGGTGAGGATTCTTTGGATTATTCGGAGAAAGAGTTGGGTCCCGACT 

L6F0NQSKSPSS.SGT STPKKPNKPLSOPR. AO 

■ N-~ — I,. 

TAAGCCTTCTCAGTTGAAGAAACCTCG TTGGAAGCGTGTTCCTACCAGAGAGGAAAATGTTATTCAGTGCTTTGGTCCTCGTGATTTTAATC 
1 1 ' 1 1 ' 1 1 1 I ■ 1 | t | i | 26864 

ATTCGGAAGAGTCAACTTCTTTGGAGCAACCTTCGCACAAGGATGGfCTCTCCTTTTACAATAAGTCACGAAACCAGGAGCACTAAAATTAG 

K PSQLKKPRWKRVPT KEENVIOCFGPRDFN 

■ N— — — . , i . ,. , 
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ACAATATGGGGGATTCAGATCTTGTTCAGAATGGTGTTGAT.GCCAAGGGTTTTCCACAGCTTGCTGAATTGATTCCTAATCAGGCTGCGTTA 

? ■ 1 " I 1 1 » " H ' 1 ' H- — ' 1 ■ 1 1 1 I • 1 ' 26956 

TGTTATACCCCCTAAGTCTAGAACAAGTCTTACCACAACTACGGTTCCCAAAAGGTGTCGAACGACTTAACTAAGGATTAGTCCGACGCAAT 

HNMG0S0LVQNGVDAKGFPQIAEL1PN0AAL 
N 



TTCTTTGATAGTGAGGTTAGCACTGATGAAGTGGGTGATAATGTTCAGATTACCTACACCTACAAAATGCTTGTAGCTAAGGATAATAAGAA 

H i I ? 1 1 H h 1 1 1 1 1 27048 

AAGAAACTATCACTCCAATCGTGACTACTTCACCCAC TATTACAAGTCTAATGGATGTGGATGTTTTACGAACATCGATTCCTATTATTCTT 

FFDSEVSTDEVGDNVQ I TYTYKMLVAKDNKN 
— N 



CCTTCCTAAGTTCATTGAGCAGATTAGTGCTTTTACTAAACCCAGTTCTATCAAAGAAATGCAGTCACAATCATCTCATGTTGCTCAGAACA 

H • I 1 1 I h 1 " ■ ■ i 1 1 1 1 1 ' 1 " 1 1 »- 27140 

GGAAGGATTCAAGTAACTCGTCTAATCACGAAAATGATTTGGGTCAAGATAGTTTCTTTACGT'CAGTGTTAGTAGAGTACAACGAGTCTTGT . 

LPKF lEQ'l.SAFTXPS'siXENOSQSSHVAON 
. N 



CAGTACTTAATGCTTCTATTCCAGAATCTAAACCATTGGCTGATGATGATTCAGCCATTATAGAAATTGTCAACGAGGTTTTGCATTAAATT 

— i 1 t ■ < 1 1 1 1 1 h 1 I 1 » 1 1 — 27232 

GTCATGAATTACGAAGATAAGGTCTTAGATTTGGTAACCGACTACTACTAAGTCGGTAATATCTTTAACAGTTGCTCCAAAACGTAATTTAA 

TVLNASIPESK.PLAOOOSAI IEIVNEVLH. h 1 " 
_ N : ' 

GTTTTGTAATTCCAGTTGAATGTTTATTATTATTAGTTGCAACCCCATGCGTTTAGCGCATGATAAGGGTTTAGTCTTACACACAATGGTAG 

— , 1 | 1 1 1 — 1 , 1 , 1 ^-h 1 h H-r— 27324 

CAAAACATTAAGGTCAACTTACAAATAATAATAATCAACGTTGGGGTACGCAAATCGCGTACTATTCCCAAATCAGAATGTGTGTTACCATC 



■3*UTR« 



GCCAGTGATAGTAAAGTGTAAGTAATTTGCTATCATATTAACATGTCTAGAGGAAAGTCAGAACTTTTTCTGTTTGTGTTGTTGGAGTACTT 

> ■ ■ ■■» 1 1 1 1 1 1 1 . h 1 1 1 I 1 1 1- 27416 

CGGTCACTATCATTTCACATTCATTAAACGATAGTATAATTGTACAGATCTCCTTTCAGTCTTGAAAAAGACAAACACAACAACCTCATGAA 



■3'UTR- 



AAAGATCGCATAGGCGCGCCAACAATGGAAGAGCCAACAACATATCTAAAAATGTTTTGTCTGGTACTTGTTAATGATATTGTTTTTGATAT 

1 1 h~ I I ' "I 1 1 H h h h , 27508 

TTTCTAGCGTATCCGCGCGGTTGTTACCTTCTCGGTTGTTGTATAGATTTTTACAAAACAGACCATGAACAATTACTATAACAAAAACTATA 



■3'UTR- 



GGATACACAAAAAAAAAAAAAAAA 

■H h 1 1 27532 

CCTATGTGTTTTTTTTTTTTTTTT 



■3'UTR« 
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Fig. 4 Alignments 

a. 5' untranslated region (Genomic sequence) aligned with human 
coronavirus 229E 

— i — i ....i. ...i ...i ....i. ...i 

5 15 25 35 45 55 

EMCR5 ' UTR AGATAGA GAATTTTCTT ATTTAGACTT TGTGTCTACT 

229E5'UTR ACTTAAGTAC CTTATCTATC TACAGATAGA AAAGTTGCTT -TTTAGACTT TGTGTCTACT 

. ... I .... I I .... I ....|....| ,...|. ...| I .... I • ... I .... I 

65 75 85 95 105 115 

EMCR5 'UTR CCTCTCAACT AAACGAAATT TTT-CTAGTG CTGTCATTTG TTATG — GCA GTCCTAGTGT 
229E5*UTR TTTCTCAACT AAACGAAATT TTTGCTATGG CCGGCATCTT TGATGCTGGA GTCGTAGTGT 

I.... I ....I.... I . ... I .... | . . . . | I - ... I • ■•• I ....|....| 

125 135 145 155 165 175 

EMCR5 ' UTR AATTGAAATT TCGTCAAGTT TGTAA-ACTG GTTAGGCAAG TGTTGTATTT TCTGTGTTTA 

229E5'UTR AATTGAAATT TCATTTGGGT TGCAACAGTT TGGAAGCAAG TGCTGTGTGT CCTA-GTCTA 

....I. ...I ....I. ...I I I . ... I .... I ,...|. ...| 

185 195 205 215 225 235 

EMCR5 1 UTR AGCACTGGTG GTTCTGTC-C ACTAGTGCAC AC-ATTGATA CTTAAGT-GG TGTTCTGTCA 
229E5'UTR AGGGTTTCGT GTTCCGTCAC GAGATTCCAT TCTACAAACG CCTTACTCGA GGTTCCGTCT 

....I. ...I . ... I .... | ....|. ...| ....|.... | .... 

245 255 265 275 285 

EMCR5 ' UTR CTGCTTATTG TGGAAGCAAC GTTCTGTCGT TGTGGAAACC AATAACTGCT AACC 

229E5'UTR CGTGTTTGTG TGGAAGCAAA GTTCTGTCTT TGTGGAAACC AGTAACTGTT CCTA 



b. Putative Orf la 

. ... i .... i ...i ..i . ... i .... i — i. ...i 

5 15 25 35 45 55 

EMCR MFYNQVT LAVASDSEIS GFGFAIPSVA VRAYSEAAAQ GFQACRFVAF 

229E MACNRVT LAVASDSEIS ANGCSTIAQA VRRYSEAASN GFRACRFVSL 

PEDV MASNHVT LAFANDAEIS AFGFCTASEA VSYYSEAAAS GFMQCRFVSL 

TGEV MSSKQFK ILVNEDYQVN VPSLPIR-DV LQEIKYCYRN GFEGYVFVPE 

OC43 MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDMIC STTAQKLETD GICPENHVMV 

BoCoV MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDIVC STTAQKLETG GICPENHVMV 

MHV MAKMGKYGLG FKWAPEFPWM LPNASEKLGS PERSEEDGFC PSAAQEPKTK GKTLINHVRV 

AIPV MASSLKQ GVSPKPRDVI LVSKDIPEQL CDALFFYTSH NPKDYADAFA 

SARS CoV MESLVLG VNEKTHVQLS LPVLQVRDVL VRGFGDSVEE ALSEAREHLK 

....|.. ..I . ... 1 .... I ....I.. ..| ....|.... | ( .... I ....I. ...I 

65 75 85 95 105 115 

EMCR GLQDCVTGIN DDD-YVIALT GTNQLCAKIL LFSDRPLNLR GWLIFSNSNY VLQDFDWFG 

229E DLQDCIVGIA DDT-YVMGLH GNQTLFCNIM KFSDRPFMLH GWLVFSNSNY LLEEFDWFG 

PEDV DLADTVEGLL PED-YVMVVI GTTKLSAYVD TFGSRPRNIC GWLLFSNCNY FLEELELTFG 

TGEV YCRDLVDCDR KDH-YVIGVL GNGVSDLKPV LLTEPSVMLQ GFIVRANCNG VLEDFDLKIA 

OC43 DCRRLLKQEC CVQSSLIREI VMNASPYDLE VLLQDALQSR EAVLVTTPLG MSLEACYVRG 

BOCOV DCRRLLKQEC CVQSSLIREI VMNTRPYDLE VLLQDALQSR EAVLVTPPLG MSLEACYVRG 

MHV DCSRLPALEC CVQSAIIRDI FVDEDPLNVE ASTMMALQFG SAVLVKPSKR LSIQAWAKLG 

AIPV VRQKFDRSLQ TGKQFKFETV CGLFLLKGVD KITPG VPAKVLKATS KLADLEDIFG 

SARS CoV NGTCGLVELE KGVLPQLEQP YVFIKRSDAL STNHGHKWE LVAEMDGIQY GRSGITLGVL 

....I....! ....|....| I | ....|. ...| ..| ....|. ...I 

125 • 135 145 155 165 175 

EMCR — HGAGSVVF VDKYMCGFDG KPVLPKNMWE FRDYFNDNTD S-IVIGGVTY QLAWDVIRKD 

229E K-RGGGNVTY TDQYLCGADG KPVMSEDLWQ FVDHFGENEE — IIINGHTY VCAWLTKRKP 

PEDV --RRGGNIVP VDQYMCGADG KPVLQESEWE YTDFFADSED GQLNIAGITY VKAWIVERSD 

TGEV — RTGRGAIY VDQYMCGADG KPVIEG D FKDYFGDED- -IIEFEGEEY HCAWTTVRDE 

OC43 C-NPKGWTMG LFRRRSVCNT GRCTVNKHVA YQLYMIDPAG VCLGAGQ FVGWVIPLAF 

BoCoV C-NPNGWTMG LFRRRSVCNT GRCAVNKHVA YQLYMIDPAG VCFGAGQ FVGWVIPLAF 

MHV V-LPKTPAMG LFKRFCLCNT RECVCDAHVA FQLFTVQPDG VCLGNGR FIGWFVPVTA 

AIPV VSPLARKYRE LLKTACQWSL TVEALDVRAQ TLDEIFDPT EILWLQVAAK 

SARS CoV VPHVGETPIA YRNVLLRKNG NKGAGGHSYG IDLKSYDLGD ELGTDPIEDY EQNWNTKHGS 
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....I. ...| ...,|. ...I . ... I .... I ....I. ..-I . . . . | I 

185 195 205 215 225 235 

EMCR LSYEQQNVLA IESIHYLG-T TGHTLKSGCK LINAKPPKY SSKWLSG EWNAVYKAFG 

22 9E LDYKRQNNLA IEEIEYVHGD ALHTLRNGSV LEMAXEVKT SSKWLSD ALDKLYKVFG 

PEDV VSYASQNLTS IKSITYCS-T YEHTFLOGTA MKVARTPKI KKNVVLSE PLATIYREIG 

TGEV KPLNQQTLFT IQEIQYNL-D I PHKLPNCAT RHVAPPVKK NSKIVLSE DYKKLYDIFG 

OC43 MPVQSRKFIV PWVMYLRKRG EKGAYNKDHG RGGFGH VYDFKVED AYDQVHDEPK 

BoCoV MPVQSRKFIA PWVMYLRKCG EKGAYIKDYK RGGFEH VYNFKVED AYDLVHDEPK 

MHV I PAYAKQWLQ PWSILLRKGG NKGSVTSGHF RRAVTMP VYDFNVED ACEEVHLNPK 
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SARS CoV MDGCTSSTCM MCYKRNRATR VECTTIVNGM KRSFYVYANG GRGFCKTHNW NCLNCDTFCT 
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2705 2*715 2725 2735 2745 2755 

EMCR GNTFINGDIA RELGNWKTA VQPTAPAYVI IDKVDFVNGF YRLYSGDTFW RYDFDITESK 

22 9E GSTFITPEVS RELGNITKTN VQPTGPAYVM IDKVEFENGF YRLYSCETFW RYNFDITESK 

PEDV GCTFINDVIA TEVGNWKLN VQPTGPATIL IDKVEFSNGF YYLYSGDTFW KYNFDITDSK 

TGEV ENTFICDEIV RDLSNSVKQT VYATDRSHQE VTKVECSDGF YRFYVGOEFT SYDYDVKHKK 

OC43 GNTFITVEAA LDLSKELKRP IQPTDVAYHT VTDVKQVGCS MRLFYDRDGQ RTYDDVNASL 

BoCoV GNTFITVEAA LDLSKELKRP IQPTDVAYHT VTDVKQVGCY MRLFYDRDGQ RTYDDVNASL 

MHV GNTFITHEAA ADLSKELKRP VNPTDSAYYL VTEVKQVGCS MRLFYERDGQ RVYDDVSASL 

AIPV QNTFMSPEVA GELSEKLKRH VKPTAYAYHV VDEACLVDDF VNLKYKAATP GKDSASSAVK 

SARS CoV GSTFTSDEVA RDLSLQFKRP INPTDQSSYI VDSVAVKNGA LHLYFDKAGQ KTYERHPLSH 



....|....| ....|. ...| ....|....| . ... I .... I ....I. ...I • . . . 1 I 

2765 2775 2785 2795 2805 2815 

EMCR YSCKEVLKN CNVLENFIVY NNSGS — NIT QIKNACVYFS QLLCEPIKLV 

229E YSCKEVFKN CNVLDDFIVF NNNGT — NVT QVKNASVYFS QLLCRPIKLV 

PEDV YTCKEALKN CSIITDFIVF NNNGS--NVN QVKNACVYFS QMLCKPVKLV 

TGEV YSSQEVLKS MLLLDDFIVY SPSGS— ALA NVRNACVYFS QLIGKPIKIV 

OC43 FVDYSNLLHS KV KSVPNMHVW VENDA--DKA NFLNAAVFYA QSLFRPILMV 

BoCoV FVDYSNLLHS KV KSVPNMHVW VEND A — DKA NFLNAAVFYA QSLFRPILMV 

MHV FVDMNGLLHS KV KGVPETHVW VENEA — DKA GFLNAAVFYA QSLYRPMLLV 

AIPV CFSVTDFLKK AVFLKEALKC EQISNDGFIV CNTQSAHALE EAKNAAIYYA QYLCKPILIL 
SARS COV FVNLDNLRAN NT KGSLPINVIV FDGKSKCDES ASKSASVYYS QLMCQPILLL 



I.. ..I ....I. ...I ...,|....| ...,|....| ,...|....| 

2825 2835 2845 2855 2865 2875 

EMCR NSELLSTLS VDFNGVLHK AYVDVLCNSF FKELTANMSM AECKATLGLT 

229E DSELLSTLS- -VDFNGVLHK AYIDVLRNSF GKDLNANMSL AECKRALGLS 

PEDV DSALLASLS- -VDFGASLHS AFVSVLSNSF GKDLSSCNDM QDCKSTLGFD D 

TGEV NSDLLEDLS VDFKGALFN AKKNVIKNSF NVDVSECKNL DECYRACNLN 

OC43 DKNLITTANT GTSVTETMFD VYVDTFLSMF DVDKKSLNAL IATAHSSIKQ GTQIYKVLDT 

BoCoV DKILITTANT GTSVTETMFD VYVDTFLSMF DVDKKSLNAL IATAHSSIKQ GTQICKVLDT 

MHV EKKLITTANT GLSVSQTMFD LYVDSLLGVL DVDRKSLTSF VNAAHNSLKE GVQLEQVMDT 

AIPV DQALYEQLW -EPVSKSVID KVCSILSSII SVDTAALNYK AGTLRDALLS 

SARS COV DQVLVSDVGD STEVSVKMFD AYVDTFSATF SVPMEKLKAL VATAHSELAK GVALDGVLST 



....|....| . ... I .... I ....|. ...| ....|. ...| ....|. ...| 

2885 2895 2905 2915 2925 2935 

EMCR VSDDDF VSAVANAHRY DVLLSDLSFN NFFISYAKPE DK-LSVYDIA 

22 9E ISDHEF TSAISNAHRC DVLLSDLSFN NFVSSYAKPE EK-LSAYDLA 

PEDV VPLDTF NAAVAEAHRY DVLLTDMSFN NFTTSYAKPE EK-FPVHDIA 

TGEV VSFSTF EMAVNNAHRF GILITDRSFN NFWPSKVKPG SSGVSAMDIG 

OC43 FLSCARKSCS IDSDVDTKCL ADSVMSAVSA GLELTDESCN NLVPTYLKSD N — IVAADLG 

BOCOV FLSCARKSCS IDSDVDTKCL ADSVMSAVSA GLELTDESCN NLVPTYLKGD N— IVAADLG 

MHV FIGCARRKCA IDSDVETKSI TKSIMSAVNA GVDFTDESCN NLVPTYVKSD T— IVAADLG 

AIPV ITKDEEA VDMAIFCHNH DVDYTGDGFT NVIPSYGIDT G-KLTPRDRG 

SARS COV FVSAARQG-V VDTDVDTKDV IECLKLSHHS DLEVTGDSCN NFMLTYNKVE N — MTPRDLG 



....|....| ....|....| ....t....| ,.„|.. ,.| 
2945 2955 2965 2975 2985 2995 

EMCR CCMRAGSKVV NHNVLIKESI PIVWGVKDFN TLSQEGKKYL VKTTKAKGLT FLLTFNDNQA 

22 9E CCMRAGAKW NANVLTKDQT PIVWHAKDFN SLSAEGRKYI VKTSKAKGLT FLLTINENQA 

PEDV TCMRVGAKIV NHNVLVKDSI PWWLVRDFI ALSEETRKYI IRTTKVKGIT FMLTFNDCRM 

TGEV KCMTSDAKIV NAKVLTQRGK SWWLSQDFA ALSSTAQKVL VKTFVEEGVN FSLTFNAVGS 

OC43 VLIQNSAKHV QGNVAKIAGV SCIWSVDAFN QFSSDFQHKL KKACCKTGLK LKLTYNKQMA 

BOCOV VLIQNSAKHV QGNVAKIAGV SCIWSVDAFN QLSSDFQHKL KKACCKTGLK LELTYNKQMA 

MHV VLIQNNAKHV QANVAKAANV ACIWSVDAFN QLSADLQHRL RKACSKTGLK I KLT YNKQEA 

AIPV FLINADASIA NLRVKN — AP PVVWKFSELI KLSDSCLKYL ISATVKSGVR FFITKSGAKQ 

SARS CoV ACIDCNARHI NAQVAKSHNV SLIWNVKDYM SLSEQLRKQI RSAAKKNNIP FRLTCATTRQ 



....|. ...| ....|....| ....I. ...| ....|.. ..| ....|. ...| 

3005 3015 3025 3035 3045 3055 

EMCR ITQVP A TSIVAKQGAG FKRTYNFLWY VCLFVVALFI GVSFID 

229E VTQIP A TSIVAKQGAG D AGHSLTWLWL LCGLVCLIQF YLCFFMPY-- 

PEDV HTTIP T VCIANKKGAG LP S FSKVKKFFWF LCLFIVAAFF ALSFLD 

TGEV DDDLPYERFT ESVSPKSGSG FFDVITQLKQ IVILVFVFIF ICGLCSVYSV 

OC4 3 NVSVL T TPFSLKGGAV FS Y FVYVCFVLSL VCFIGLWCLM PTYTVH 

BOCOV NVSVL T TPFSLKGGAV FS Y FVYVCFVLSL VCFIGLWCLM PTYTVH 

MHV NVPIL T TPFSLKGGAV FS K VLQWLFVVNL ICFIVLWALM PTYAVH 

AIPV VIACHT--QK LLVEKKAGGI VSGTFKCFKS YFKWLLIFYI LFTACCSGYY YMEVSKSFVH 

SARS CoV VVNVI T TKISLKGGKI VS T CFKLMLKATL LCVLAALVCY IVMPVHTLS- 



. ... | .... I ....|. ...| ....|.. ..| | I . ... I .... I 

3065 3075 3085 3095 3105 3115 

EMCR -YTTTVTSFH GYDFKYIENG QLKVFEAPLH CVRNVFDNFN QWHEAKFGVV TTNSD-KCPI 

229E FMYDIVSSFE GYDFKYIENG QLKNFEAPLK CVRNVFENFE DWHYAKFGFT PLNKQ-SCPI 

PEDV -FSTQVSSDS DYDFKYIESG QLKTFDNPLS CVHNVFINFD QWHDAKFGFT PVNNP-SCPI 

TGEV ATQSYIESAE GYDYMVIKNG IVQPFDDTIS CVHNTYKGFG DWFKAKYGFI PTFGK-SCPI 

OC43 — KSDFQLPV YASYKVLDNG VIRDVSVEDV CFANKFEQFD QWYESTFGLS YYSNSMACPI 

BoCoV — KSDFQLPV YASYKVLDNG VIRDVSVEDV CFANKFEQFD QWYESTFGLS YYSNSMACPI 

MHV — KSDMQLPL YASFKVIDNG VLRDVTVTDA CFANKFIQFD QWYESTFGLV YYRNSRACPV 

AIPV PMYDVNSTLH VEGFKVI DKG VLREIVPEDT CFSNKFVNFD AFWG RP YDNSR-NCPI 

SARS COV — IHDGYTNE IIGYKAIQDG VTRDIISTDD CFANKHAGFD AWFSQRGGSY KNDKS — CPV 
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EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS COV 



3125 
WG---VSER 
WG— VSEI 

VVG VSDE 

WGTVFDLEN 
WA-VIDQDF 
WA-WDQDF 
WA-VIDQDI 
VTA— VIDGD 
VAA-IITREI 



3135 
INVVPGVPTN 
VNTVAGIPSN 
ARTVPGIPAG 
MRPIPDVPAY 
GSTVFNVPTK 
GSTVFNVPTK 
GYTLFNVPTK 
GTVATGVPGF 
GFIVPGLPGT 



....|... 
3145 

VYLVG 

VYLVG 

VYLAG 

VSIVG- — 

VLRYG 

VLRYG 

VLRYG— 



..|. ...I 
3155 

KTLV 

KTLI 

KTLV 

RSLV 

YHVL 

YHVL 

FHVL 



VSWVMDGVMF IHMTQTERKP 
VLRAIN GDFL 



....|.. ..I 

3165 
FTLQAAFGNT 
FTLQAAFGNA 
FAINTIFGTS 
FAINAAFGVT 
HFITHALSAD 
HFITHALSAD 
HFITHAFATD 
WYIPTWFNRE 
HFLPRVFSAV 



. ... I .... I 

3175 
GVCYDFOGVT 
GVCYDIFGVT 
GLCFDASGVA 
NMCYDHTGNA 
GVQCYTPHSQ 
GVQCYTPHSQ 
SVQCYTPHMQ 
IVGYTQDSII 
GNICYTPSKL 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS CoV 
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3185 

TS DK 

TP EK 

DK GA 

VSKD-SYFDT 
ISYSNFYASG 
ISYSNFYASG 
IPYDNFYASG 
TEG-SFYTSI 
IEYSDFATSA 



....|. ...I 

3195 
CIFNSACTRL 
CIFTSACTRL 
CIFNSACTTL 
CVFNTACTTL 
CVLSSACTMF 
CVLSSACTMF 
CVLSSLCTML 
ALFSARCLYL 
CVLAAECTIF 



3205 
EGLGGD-NVY 
EGLGGN-NVY 
SGLGGT-AVY 
TGLGGT-IVY 
TMADGSPQPY 
AMADGSPQPY 
AHADGTPHPY 
TASNTP-QLY 
KDAMGKPVPY 



3215 
CYN-TDLIEG 
CYN-TALMEG 
CYK-NGLVEG 
CAK-QGLVEG 
CYT-EGLMQN 
CYT-DGLMQN 
CYT-EGIMHN 
CFNGDNDAPG 
CYD-TNLLEG 



3225 
SKPYSILQPN 
SLPYSSIQAN 
AKLYSELAPH 
AKLYSDLMPD 
ASLYSSLVPH 
ASLYSSLVPH 
ASLYDSLAPH 
ALPFGSIIPH 
SISYSELRPD 



....|. ...I 

3235 
AYYKYDVKN- 
AYYKYDNGN- 
SYYKMVDGN- 
YYYEHASGN- 
VRYNLANAKG 
VRYNLANAKG 
VRYNLANSNG 
RVYFQPNGVR 
TRYVLMDGS- 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BoCoV 

MHV 

AIPV 

SARS COV 



. ... I .... I 

3245 
YVRFPEILAR 
FIKLPEVIAQ 
AVSLPEIISR 
MVKLPAIIR- 
FIRFPEVLRE 
FIRLPEVLRE 
YIRFPEWSE 
LIVPQQILHT 
IIQFPNTYLE 



I.... I 

3255 
GFGLRTIRTL 
GFGFRTVRTI 
GFGIRTIRTK 
GLGLRFVKTQ 
GL-VRIVRTR 
GL-VRIVRTR 
GI-VRIVRTR 
PY— WKFV 
GS-VRWTTF 



3265 
ATRYCRVGEC 
ATKYCRVGEC 
AMTYCRVGQC 
ATTYCRVGEC 
SMSYCRVGLC 
SMSYCRVGLC 
SMTYCRVGLC 
SDSYCRGSVC 
DAEYCRHGTC 



3275 
RDSHKGVCFG 
VESNAGVCFG 
VQSAEGVCFG 
IDSKAGFCFG 
EEADEGICFN 
EEADEGICFN 
EDAEEGVCFN 
EYTRPGYCVS 
ERSEVGICLS 



3285 
FDKWYVNDGR 
FDKWFVNDGR 
ADRFFVYNAE 
GDNWFVYDNE 
FNGSWVLNND 
FNGSWVLNND 
FNSSWVLNNP 
LNPQWVLFND 
TSGRWVLNNE 



....I. ...I 
3295 

VD DGYIC 

VA NGYVC 

SG SDFVC 

FG NGYIC 

YYRSLPGTFC 
YYRSLPGTFC 
YYRAMPGTFC 
EYTSKPGVFC 
HYRALSGVFC 



EMCR 

229E 

PEDV 

TGEV 

OC43 

BOCOV 

MHV 

AIPV 

SARS CoV 
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3305 
GDGLIDLLVN 
GTGLWNLVFN 
GTGLFTLLMN 
GNSVLGFFKN 
GRDVFDLIYQ 
GRDVFDLIYQ 
GRNAFDLIHQ 
GSTVRELMFS 
GVDAMKLIAN 



3315 
VLSIFSSSFS 
ILSMFSSSFS 
VISVFSKTVP 
VFKLFNSNMS 
LFKGLAQPVD 
LFKGLAQPVD 
VLGGLVRPID 
MVSTFFTGVN 
IFTPLVQPVG 



....I. ...| 

3325 
WAMSGHMLF 
VAAMSGQILL 
VTVLSGOILF 
WATSGAMLV 
FLALTASSIA 
FLALTASSIA 
FFALTASSVA 
-PNIYMQLAT 
ALDVSASWA 



....!....! 

3335 
NFLFAAFITF 
NCALGAFAIF 
NCIIAFVAVA 
NIIIACLAIA 
GAILAVIWL 
GAILAVIWL 
GAILAIIWL 
MFLILWWL 
GGIIAILVTC 



....|. ...I 

3345 
LCFLVTKFKR 
CCFLVTKFRR 
VCFLFTKFKR 
MCYGVLKFKK 
VFYYLIKLKR 
GFYYLIKLKR 
AFYYLIKLKR 
IFAMVIKFQG 
AAYYFMKFRR 



....I ....! 

3355 
VFGDLSYGVF 
MFGDLSVGVC 
MFGDMSVGVF 
IFGDCTFLIV 
AFGDYTSWF 
AFGDYTSIVF 
AFGDYTSWV 
VFKAYATTVF 
VFGEYNHWA 



EMCR 

229E 

PEDV 

TGEV 
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BoCoV 

MHV 

AIPV 
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3365 
TWCATLINN 
TVWAVLLNN 
TVGACTLLNN 
MIIVTLWNN 
VNVIVWCVNF 
VNVIVWCVNF 
INVIVWCINF 
ITMLVWVINA 
ANALLFLMSF 



I I 

3375 
ISYVVTQN-L 
VSYIVTQN-L 
VSYIVTQN-T 
VSYFVTQN-T 
MMLFVFQVYP 
MMLFVFQVYP 
LMLFVFQVYP 
FILCVHSYNS 
TILCLVPAYS 



3385 
FFMLLYAILY 
VTMIAYAILY 
LGMLGYATLY 
FFMIIYAIVY 
ILSCVYAICY 
TLSCVYAICY 
TLSCLYAC FY 
VLAVILLVLY 
FLPGVYSVFY 



I I 

3395 
FVFTRTVR — 
FFATRSLR — 
FLCTKGVR — 
YFITRKLA — 
FYATLYFPSE 
FYATLYFPSE 
FYTTLYFPSE 
CYASLVTSRN 
LYLTFYFTND 



.... I I 

3405 
YAWIWHIAYI 
YAWIWCAAYL 
YMWIWHLGFL 
YPGILDAGFI 
ISVIMHLQWL 
ISVIMHLQWL 
ISVVMHLQWL 
TVIIMHCWLV 
VSFLAHLQWF 



....| ....( 

3415 
VAYFLLIPWW 
IAYISFAPWW 
ISYILIAPWW 
IAYINMAPWY 
VMYGTIMPLW 
VMYGTIMPLW 
VMYGAIMPLW 
FTFGLIVPTW 
AMFSPIVPFW 
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PEDV 

TGEV 
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MHV 

AIPV 

SARS CoV 
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3425 
LLTWFSFAAF 
LCAWYFLAML 
VLMVYAFSAI 
VITAYILVFL 
FCLLYIAVW 
FCLLYISVW 
FCI IYVAVW 
LACCYLGFI I 
ITAIYVFCIS 



....(.. ..| 

3435 
LELLPNVFKL 
TGLLPSLLKL 
FEFMPNLFKL 
YDSLPSLFKL 
SN — HAFWVF 
SN--HAFWVF 
SN— HALWLF 
YMYTPLFLWC 
LKHCHWFFNN 



....I. ...| 
3445 

K ISTQL 

K VSTNL 

K VSTQL 

K VSTNL 

S YCRKL 

S YCRQL 

S YCRKL 

YGTTKNTRKL 
Y LRKRV 



I I 

3455 
FEGDKFIGTF 
FEGDKFVGTF 
FEGDKFVGSF 
FEGDKFVGNF 
GTSVRSDGTF 
GTSVRSDGTF 
GTEVRSDGTF 
YDGNEFVGNY 
MFNGVTFSTF 
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3465 
ESAAAGTFVL 
ESAAAGTFVI 
ENAAAGTFVL 
ESAAMGTFVI 
EEMALTTFMI 
EEMALTTFMI 
EEMSLTTFMI 
DLAAKSTFVI 
EEAALCTFLL 



3475 
DMRSYERLIN 
DMRSYEKLAN 
DMHAYERLAN 
DMRSYETIVN 
TKDSYCKLKN 
TKDSYCKLKN 
TKESYCKLKN 
RGSEFVKLTN 
NKEMYLKLRS 
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TGEV 
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3485 
T — ISPEKLK 
S — ISPEKLK 
S--ISTEKLR 
S~ TSIARIK 
S— LSDVAFN 
S--LSDVAFN 
S— VSDVAFN 
E — I-GDKFE 
ETLLPLTQYN 



....| ....| 

3495 
NYAASYNKYK 
SYAASYNRYK 
QYASTYNKYK 
SYANSFNKYK 
RYLSLYNKYR 
RYLSLYNKYR 
RYLSLYNKYR 
AYLSAYARLK 
RYLALYNKYK 



....|.... | 

3505 
YYSGSASEAD 
YYSGNANEAD 
YYSGSASEAD 
YYTGSMGEAD 
YYSGKMDTAA 
YYSGKMDTAA 
YFSGKMDTAA 
YYSGTGSEQD 
YFSGALDTTS 



■ I. 



3515 
YRCACYAHLA 
YRCACYAYLA 
YRLACFAHLA 
YRMACYAHLG 
YREAACSQLA 
YREAACSQLA 
YREAACSQLA 
YLQACRAWLA 
YREAACCHLA 



3525 
KAMLDYAKDH 
KAMLDFSRDH 
KAMMDYASNH 
KALMDYSVNR 
KAMDTFTNNN 
KAMDTFTNNN 
KAMETFNHNN 
YALDQYR-NS 
KALNDFS-NS 



• I I 

3535 
N-DMLYSPPT 
N-DILYTPPT 
N-DTLYTPPT 
T-DMLYTPPT 
GSDVLYQPPT 
GSDVLYQPPT 
GNDVLYQPPT 
GVEIVYTPPR 
GADVLYQPPQ 
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3545 3555 3565 3575 3585 3595 

EMCR ISYN-STLOS GLKKMAQPSG CVERCWRVC YGSTVLNGVW LGDTVTCPRH VIAPS-TTVL 

22 9E VSYG-STLQA GLKKMAQPSG FVEKCWRVC YGNTVLNGLW LGDIVYCPRH VIASN-TTSA 

PEDV VSYN-STLQA GLKKMAQPSG WEKCIVRVC YGNMALNGLW LGDIVMCPRH VIASS-TTST 

TGEV VSVN-STLQS GLRKMAQPSG LVEPCIVRVS YGNNVLNGLW LGDEVICPRH VIASD-TTRV 

OC43 ASVSTSFLQS GIVKMVNPTS KVEPCWSVT YGNMTLNGLW LDDKVYCPRH VICSASDMTN 

BoCoV ASVSTSFLQS GIVKMVNPTS KVEPCIVSVT YGNMTLNGLW LDDKVYCPRH VICSASDMTN 

MHV ASVTTSFLQS GIVKMVFPTS KVEPCWSVT YGNMTLNGLW LDDKVYCPRH VICSSADMTD 

AIPV YSIGVSRLQS GFKKLVSPSS AVEKCIVSVS YRGNNLNGLW LGDTIYCPRH VLG KFSG 

SARS CoV TSITSAVLQS GFRKMAFPSG KVEGCMVQVT CGTTTLNGLW LDDTVYCPRH VICTAEDMLN 

I .... I I .... I I I ....|....| I I 

3605 3615 3625 3635 3645 3655 

EMCR IDYDHAYSTM RLHNFSVSHN G-VFLGWGV TMHGSVLRIK VSQSNVHTPK HVFKTLKPGA 

229E IOYDHEYSIM RLHNFSIISG T-AFLGWGA TMHGVTLKIK VSQTNMHTPR HSFRTLKSGE 

PEDV IDYDYALSVL RLHNFSISSG N-VFLGWSA TMRGALLQIK VNQNNVHTPK YTYRTVRPGE 

TGEV INYENEMSSV RLHNFSVSKN N-VFLGWSA RYKGVNLVLK VNQVNPNTPE KKFKSIKAGE 

OC43 PDYTNLLCRV TSSDFTVLFD R-LSLTVMSY QMRGCMLVLT VTLQNSRTPK YTFGWKPGE 

BoCoV PDYTNLLCRV TSSDFTVLFD R-LSLTVMSY QMQGCMLVLT VTLQNSRTPK YTFGWKPGE 

MHV PDYSNLLCRV ISSDFCVMSG R-MSLTVMSY QMQGSLLVLT VTLQNPNTPK YSFGWKPGE 

AIPV DQWNDVLNLA NNHEFEVTTQ HGVTLNWSR RLKGAVLILQ TAVANAETPK YKFIKANCGD 

SARS CoV PNYEDLLIRK SNHS FLVQAG N-VQLRVIGH SMQNCLLRLK VDTSNPKTPK YKFVRIQPGQ 

....|. ...| .,..|. ...| ....|. ...| ....|. ...| ...,|....| ....|.. ..| 
3665 3675 3685 3695 3705 3715 

EMCR SFNILACYEG IASGVFGVNL RTNFTIKGSF INGACGSPGY NVRNDGTVEF CYLHQIELGS 

229E GFNILACYDG CAQGVFGVNM RTNWTIRGSF INGACGSPGY NLKN-GEVEF VYMHQIELGS 

PEDV SFNILACYDG AAAGVYGVNM RSNYTIRGSF INGACGSPGY NINN-GTVEF CYLHQLELGS 

TGEV SFNILACYEG CPGSVYGVNM RSQGTIKGSF IAGTCGSVGY VLEN-GILYF VYMHHLELGN 

OC43 TFTVLAAYNG KPQGAFHVTM RSSYTIKGSF LCGSCGSVGY VIMG-DCVKF VYMHQLELST 

BoCoV TFTVLAAYNG KPQGAFHVTM RSSYTIKGSF LCGSCGSVGY VIMG-DCVKF VYMHQLELST 

MHV TFTVLAAYNG KSQGAFHVTM RSSYTIKGSF LCGSCGSVGY VLTG-DSVRF VYMHQLELST 

AIPV SFTIACAYGG TWGLYPVTM RSNGTIRASF LAGACGSVGF NIEK-GWNF FYMHHLELPN 

SARS CoV TFSVLACYNG SPSGVYQCAM RPNHTIKGSF LNGSCGSVGF NIDY-DCVSF CYMHHMELPT 

....|....| ....|....| ....|. ...| ....|....| ....|. ...I . ... I .... I 
3725 3735 3745 3755 3765 3775 

EMCR GAHVGSDFTG SVYGNFDDQP SLQVESANLM LSDNWAFLY AALLNGCR WWLRST 

22 9E GSHVGSSFDG VMYGGFEDQP NLQVESANQM LTVNWAFLY AAILNGCT WWLKGE 

PEDV GCHVGSDLDG VMYGGYEDQP TLQVEGASSL FTENVLAFLY AALINGST WWLSSS 

TGEV GSHVGSNFEG EMYGGYEDQP SMQLEGTNVM SSDNWAFLY AALINGER WFVTNT 

OC43 GCHTGTDFNG DFYGPYKDAQ WQLLIQDYI QSVNFVAWLY AAILNNCN WFVQSD 

BoCoV GCHTGTDFNG DFYGPYKDAQ WQLPVQOYI QSVNFVAWLY AAILNNCN WFVQSD 

MHV GCHTGTDFSG NFYGPYRDAQ WQLPVQDYT QTVNVVAWLY AAILNRCN WFVQSD 

AIPV ALHTGTDLMG EFYGGYVDEE VAQRVPPDNL VTNN IVAWLY AAIISVKESS FSLPKWLEST 

SARS COV GVHAGTDLEG KFYGPFVDRQ TAQAAGTDTT ITLNVLAWLY AAVINGDR WFLNRF 
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EMCR RVNVDGFNEW AMANGYTIVS SV--ECYSIL AAKTGVSVEQ LLASIQHLHE -GFGGKNILG 

22 9E KLFVEHYNEW AQANGFTAMN GE— DAFSIL AAKTGVCVER LLHAIQVLNN -GFGGKQILG 

PEDV RIAVDRFNEW AVHNGMTTVG NT — DCFSIL AAKTGVDVQR LLASIQSLHK -NFGGKQILG 

TGEV SMSLESYNTW AKTNSFTELS ST— DAFSML AAKTGQSVEK LLDSIVRLNK -GFGGRTILS 

OC43 KCSVEDFNVW ALSNGFSQVK SD — LVIDAL ASMTGVSLET LLAAIKRLKN -GFQGRQIMG 

BoCoV KCSVEDFNVW ALSNGFSQVK SD— LVIDAL ASMTGVSLET LLAAIKRLKN -GFQGRQIMG 

MHV SCSLEEFNVW AMTNGFSSIK AD— LVLDAL ASMTGVTVEQ ILAAIKRLYS -GFQGKQILG 

AIPV TVSVDDYNKW AGDNGFTPFS TS — TAITKL SAITGVDVCK LLRTIMVKNS -QWGGDPILG 

SARS CoV TTTLNDFNLV AMKYNYEPLT QDHVDILGPL SAQTGIAVLD MCAALKELLQ NGMNGRTILG 
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EMCR YSSLCDEFTL AEVVKQMYGV NLQSGK V I FGLKTMFLF SVFFTMFWAE LFIYTNTIWI 

229E YSSLNDEFSI NEVVKQMFGV NLQSGK T TSMFKSISLF AGFFVMFWAE LFVYTTTIWV 

PEDV HTSLTDEFTT GEWRQMYGV NLQGGY V SRACRNVLLV GSFLTFFWSE LVSYTKFFWV 

TGEV YGSLCDEFTP TEVI RQMYGV NLQAGK V KSFFYPIMTA MTILFAFWLE FFMYTPFTWI 

OC43 SCSFEDELTP SDVYQQLAGI KLQSKRTRLF KGTVCWIMAS TFLFSCIITA FVKWTMFMYV 

BoCoV SCSFEDELTP SDVYQQLAGI KLQSKRTRLV KGIVCWIMAS TFLFSCIITA FVKWTMFMYV 

MHV SCVLEDELTP SDVYQQLAGV KLQSKRTRW KGTCCWILAS TLLFCSIISA FVKWTMFMYV 

AIPV QYNFEDELTP ESVFNQIGGV RLQSSFVR— K — ATSWFWS RCVLACFLFV LCAIVLFTAV 

SARS CoV STILEDEFTP FDVVRQCSGV TFQGKFKKIV KGTHHWMLLT FLTSLLILVQ STQWSLFFFV 
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EMCR NPVILTPIFC LLLFLSLVLT MFLKHKFLFL QVFLLPTVIA TALYNC-VLD YYIVKFLADH 
229E NPGFLTPFMI LLVALSLCLT FVVKHKVLFL QVFLLPSIIV AAIQNC-AWD YHVTKVLAEK 
PEDV NPGYVTPMFA CLSLLSSLLM FTLKHKTLFF QVFLIPALIV TSCINL-AFD VEVYNYLAEH 
TGEV NPTFVSIVLA VTTLISTVFV SGIKHKMLFF MSFVLPSVIL VTAHNL-FWD FSYYESLQSI 
OC43 TTNMFSITFC ALCVIS-LAM LLVKHKHLYL TMYITP-VLF TLLYNN-YLV VYKHTFRGYV 
BoCoV TTNMLSITFC ALCVIS-LAM LLVKHKHLYL TMYIIP-VLF TLLYNN-YLV VYKQTFRGYV 
MHV TTHMLGVTLC ALCFVS-FAM LLVKHKHLYL TMFIMP-VLC TLFYTN-YLV VYKQSFRGLA 
AIPV PLKFYVYAAV ILLMAVLFIS FTVKHVMAYM DTFLLPTLIT VIIGVCAEVP FIYNTLISQV 
SARS CoV YENAFLPFTL GIMAIAACAM LLVKHKHAFL CLFLLPSLAT VAYFN MV YMPASWVMRI 
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LHNKINLCDD PEKAQGMLLA LLAFFLSKHS DFG L DGLIDSYFDN SSTLQSVASS 

MHNKINLCDD PETAQELLLA LLAFFLSKHS DFG L GDLVDSYFEN DSILQSVASS 

LHNKINLCND PEKAQEMLLA LLAFFLSKNS AFG L DDLLESYFND NSMLQSVAST 

LHNEINLCDD PEIVLEKLLA LIAFFLSKHN TCD L SELIESYFEN TTILQSVASA 

LHNEILATSD LSVAFEKLAQ LLIVLFANPA AVDSKCLTSI EEVCDDYAKD NTVLQALQSE 

LHNEILATSD LGVAFEKLAQ LLIVLFANPA AVDSKCLTSI EEVCDDYAKD NTVLQALQSE 
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FVSMPSYIAY ENARQAYEDA IANGSS SQLIKQLKRA MNIAKSEFDH EISVQKKINR 

FVGMPSFVAY ETARQEYENA VANGSS PQIIKQLKKA MNVAKAEFDR ESSVQKKINR 

YVGLPSYVIY ENARQQYEDA VNNGSP PQLVKQLRHA MNVAKSEFDR EASTQRKLDR 

YAALPSWIAL EKARADLEEA KKNDVS PQILKQLTKA FNIAKSDFER EASVQKKLDK 

FVNMASFVEY EVAKKNLDEA RFSGSAN QQQLKQLEKA CNIAKSAYER DRAVAKKLER 

FVNMASFVEY EVAKKNLDEA CSSGSAN— QQQLKQLEKA CNIAKSAYER DRAVARKLER 

FVNMASFVEY ELAKKNLDEA KASGSAN QQQIKQLEKA CNIAKSAYER DRAVARKLER 

FSHIPSYAEY ERAKNLYEKV LVDSKNGGVT QQELAAYRKA ANIAKSVFDR DLAVQKKLDS 

FSSLPSYAAY ATAQEAYEQA VANGDS EVVLKKLKKS LNVAKSEFDR DAAMQRKLEK 
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EMCR ATSASKLTIV SPDLESYSKI VCDGSVHYAG WWTLNDVKD NDGRPVHVKE ITR EN 

22 9E ATSAARLVW VPDHDSFVKM MVDGFVHYAG VVWTLQEVKD NDGKNVHLKD VTK EN 

PEDV AVSATKLNIV TSDIDSYNRI QREGCVHYAG TIWNIIDIKD NDGKWHVKE VTA QN 

TGEV AASATRLWI TPSLEVFSKI RQENNVHYAG AIWTIVEVKD ANGSHVHLKE VTA AN 

OC43 SLAANTLNII VPDKSVYDQV VDNVYVTYAG NVWQIQTIQD SDGTNKQLNE IS 

BOCOV SLAANTLTII VPDKSVYDQV VDNVYVTYAG NVWQIQTIQD SDGTNKQLHE IS 

MHV SLTSNTLTII VPDKQVFDQV VDNVYVTYAG NVWHIQSIQD ADGAVKQLNE ID 

AIPV IVCSNKLTLV IPDPETWVKC VEGVHVTYST WWNIDTVID ADGTELHPTS TGSGLTYCIS 
SARS COV LTTAAKLMW VPDYGTYKNT CDGNTFTYAS ALWEIQQWD ADSKIVQLSE INM DN 
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EMCR VETLTWPLIL NCER WKLQNNEIM PGKLKQKPMK AEG — DGGVL GDGNALYNTE 

229E QEILVWPLIL TCER WKLQNNEIM PGKMKVKATK GEG — DGGIT SEGNALYNNE 

PEDV AESLSWPLVL GCER IVKLQNNEII PGKLRQRSIK AEG — DG-IV GEGKAXYNNE 

TGEV ELNLTWPLSI TCER TTKLQNNEIM PGKLKERAVR ASATLDGEAF GSGKALMASE 

OC43 -DDCNWPLVI IANRY-NEVS ATVLQNNELM PAKLKIQVVN SGP — OQTCN TPTQCYYNNS 

BOCOV -DDCNWPLVI IANRH-NEVS ATVLQNNELM PAKLKTQWN SGP — DQTCN TPTQCYYNNS 

MHV -VNITWPLVI AANRH-NEVS SWLQNNELM PQKLRTQWN SGS — DMNCN TPTQCYYNTT 

AIPV GANIAWPLKV NLTRNGHNKV DWLQNNELM PHGVKTKACV AGVD-QAHCS VESKCYYTNI 

SARS CoV SPNLAWPLIV TALRA-N-- S AVKLQNNELS PVALRQMSCA AGTTQTACTD DNALAYYNNS 
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VIDTPTGPQI 
LVDS PNGAQI 
YVDGANGPEV 
TVQDAKGLKI 
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VTDTPKGPKV 
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4555 
KYLYFVKNLN 
KYLYFVKNLN 
KYLYFVRNLN 
KYLYFVKNLN 
KYLYFVKGCN 
KYLYFVKGCN 
KYLYFVKGCN 
VYLYFIKNTR 
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EMCR TLRRGAVLGF IGATIRLQAG -KQTELAVNS GLLTACAFSV DPATTYLEAV KHGAKPVSNC 

22 9E NLRRGAVLGY IGATVRLQAG -KQTEFVSNS HLLTHCSFAV DPAAAYLDAV KQGAKPVGNC 

PEDV TLRRGAVLGY IGATVRLQAG -KQTEQAINS SLLTLCAFAV DPAKTYIDAV KSGHKPVGNC 

TGEV TLRRGAVLGY IGATVRLQAG -KPTEHPSNS SLLTLCAFSP DPAKAYVDAV KRGMQPVNNC 

OC43 TLARGWVVGT ISSTVRLQAG -TATEYASNS SILSLCAFSV DPKKTYLDFI QQGGTPIANC 

BoCoV TLARGWVVGT ISSTVRLQAG -TATEYASNS SILSLCAFSV DPKKTYLDFI QQGGTPIANC 

MHV TLARGWVVGT LSSTVRLQAG -TATEYASNS AIRSLCAFSV DPKKTYLDYI QQGGAPVTNC 

AIPV SIVRGMVLGA ISNVWLQSK GHETEEVDAV GILSLCSFAV DPADTYCKYV AAGNQPLGNC 

SARS CoV NLNRGMVLGS LAATVRLQAG -NATEVPANS TVLSFCAFAV DPAKAYKDYL ASGGQPITNC 
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EMCR IKMLSNGAGN GQAITTSVDA NTNQDSYGGA SICLYCRAHV PHP SMD GYCKFKGKCV 

229E VKMLTNGSGS GQAITCTIDS NTTQDTYGGA SVCIYCRAHV AHP TMD GFCQYKGKWV 

PEDV VKMLANGSGN GQAVTNGVEA STNQDSYGGA SVCLYCRAHV EHP SMD GFCRLKGKYV 

TGEV VKMLSNGAGN GMAVTNGVEA NTQQDSYGGA SVCIYCRCHV EHP AID GLCRYKGKFV 

OC43 VKMLCDHAGT GMAITVKPDA TTSQDSYGGA SVCIYCRARV EHP DVD GLCKLRGKFV 

BOCOV VKMLCDHAGT GMAITVKPDA TTSQDSYGGA SVCIYCRARV EHP DVD GLCKLRGKFV 

MHV VKMLCDHAGT GMAITIKPEA TTNQDSYGGA SVCIYCRSRV EHP DVD GLCKLRGKFV 

AIPV VKMLTVHNGS GFAITSKPSP TPDQDSYGGA SVCLYCRAHI AHPGSVGNLD GRCQFKGSFV 

SARS COV VKMLCTHTGT GQAITVTPEA NMDQESFGGA SCCLYCRCHI DHP NPK GFCDLKGKYV 
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EMCR QVPIGCL-DP IRFCLENNVC NVCGCWLGHG CACDRTTIQS VDISYLNEQ 

229E QVPIGTN-DP IRFCLENTVC KVCGCWLNHG CTCDRTAIQS FDNSYLNES 

PEDV QVPLGTV-DP IRFVLENDVC KVCGCWLSNG CTCDRSIMQS T 

TGEV QIPTGTQ-DP IRFCIENEVC WCGCWLNNG CMCDRTSMQS F TVDQSYLNEC 

OC43 QVPVGIK-DP VSYVLTHDVC RVCGFWRDGS CSCVSTDTTV Q-- SKDT 

BOCOV QVPVGIK-DP VSYVLTHDVC QVCGFWRDGS CSCVSTDTTV Q SKDTNFLNGF 

MHV QVPLGIK-DP VSYVLTHDVC QVCGFWRDGS CSCVGTGSQF Q SKDTNFLNGF 

AIPV QIPTTEK-DP VGFCLRNKVC TVCQCWIGYG CQCDSLRQPK SSVQSVAGAS DFDKNYLNGY 

SARS CoV QIPTTCANDP VGFTLRNTVC TVCGMWKGYG CSCDQLREPL M QSADAST FLN 
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EMCR RARGSSAARL EPCN-GTDID KCVRAFDIYN KNVSFLGKCL 

22 9E EPCN-GTDID YCVRAFDVYN KDASFIGKNL 

PEDV YGLFK RVRGSSAARL EPCN-GTDTQ HVYRAFDIYN KDVACLGKFL 

TGEV EPCN-GTDPD HVSRAFDIYN KDVACIGKFL 

BoCoV FFKR VRGTSVDARL VPCASGLSTD VQLRAFDICN ASVAGIGLHL 

OC43 FFKR VRGTSVDARL VPCASGLSTD VQLRAFDIYN ASVAGIGLHL 

MHV LFLCRHRLPV SVKRHELFKR VRGTSVNARL VPCASGLDTD VQLRAFDICN ANRAGIGLYY 

AIPV MFQNL 

SARS CoV TPCGTGTSTD WYRAFDIYN EKVAGFAKFL 
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EMCR • KMNCVRFKNA DL KDGYFVIKRC TKSVMEHEQS MYNLLNFSGA LAEHDFFTWK 

229E KSNCVRFKNV DK DDAFYIVKRC IKSVMDHEQS MYNLLKGCNA VAKHDFFTWH 

PEDV KVNCVRLKNL DK HDAFYVVKRC TKSAMEHEQS IYSRLEKCGA IAEHDFFTWK 

TGEV KTNCSRFRNL DK HDAYYIVKRC TKTVMDKEQV CYNDLKDSGA VAEHDFFTYK 

BOCOV KVNCCRFQRV — DENG — DK LDQFFWKRT DLTIYNREME CYERVKDCKF VAEHDFFTFD 

OC43 KVNCCRFQRV — DENG— DK LDQFFWKRT DLTIYNREMK CYERVKDCKF VAEHDFFTFD 

MHV KVNCCRFQRA — DEDG — NT LDKFFVIKRT NLEVYNKEKE CYELTKECGV VAEHEFFTFD 

AIPV KRNCARFQEL RDTEDGNLEY LDSYFVVKQT TPSNYEHEKS CYEDLKS-EV TADHDFFVFN 

SARS CoV KTNCCRFQEK --DEEG — NL LDSYFVVKRH TMSNYQHEET IYNLVKDCPA VAVHDFFKFR 
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EMCR DGRVIYGNVS RHNLTKYTMM DLVYAMRNFD EQNCDVLKEV LVLTGCCDNS YFDSKG 

229E EGRTIYGNVS RQDLTKYTMM DLCFALRNFD EKDCEVFKEI LVLTGCCSTD YFEMKN 

PEDV DGRAIYGNVC RKDLTEYTMM DLCYALRNFD ENNCDVLKSI LIKVGACEES YFNNKV 

TGEV EGRCEFGNVA RRNLTKYTMM DLCYAIRNFD EKNCEVLKEI LVTVGACTEE FFENKD 

BoCoV VEGSRVPHIV RKDLTKYTML DLCYALRHFD RNDCMLLCDI LSIYAGCEQS YFTKKD 

OC43 VEGSRVPHIV RKDLTKYTML DLCYALRHFD RNDCMLLCDI LSIYAGCEQS YFTKKD 

MHV VEGSRVPHIV RKDLSKYTML DLCYALRHFD RNDCSTLKEI LLTYAECDES YFQKKD 

AIPV KN IYNIS RQRLTKYTMM DFCYALRHFD PKDCEVLKEI LVTYGCIEDY HPKWFEENKD 

SARS CoV VDGDMVPHIS RQRLTKYTMA DLVYALRHFD EGNCDTLKEI LVTYNCCDDD YFNKKD 

....|....| ....|.... | ....|.... | ....|. ...| ...| 
165 195 205 215 225 235 

EMCR WYDPVENEDI HRVYASLGKI VARAMLKCVA LCDAMVAKGV VGVLTLDNQD LNGNFYDFGD 

229E WFDPIENEDI HRVYAALGKV VANAMLKCVA FCDEMVLKGV VGVLTLDNQD LNGNFYDFGD 

PEDV WFDPVENEDI HRVYALLGTI VARAMLKCVK FCDAMVEQGI VGWTLDNQD LNGDFYDFGD 

TGEV WFDPVENEAI HEVYAKLGPI VANAMLKCVA FCDAIVEKGY IGVITLDNQD LNGNFYDFGD 

BOCOV WYDFVENPDI INVYKKLGPI FNRALVSATE FADKLVEVGL VGILTLDNQD LNGKWYDFGD 

OC43 WYDFVENPDI INVYKKLGPI FNRALVSATE FADKLVEVGL VGVLTLDNQD LNGKWYDFGD 

MHV WYDFVENSDI INVYKKLGPI FNRALLNTAK FADTLVEAGL VGVLTLDNQD LYGQWYDFGD 

AIPV WYDPIENSKY YVMLAKMGPI VRRALLNAIE FGNLMVEKGY • VGVITLDNQD LNGKFYDFGD 

SARS COV WYDFVENPDI LRVYANLGER VRQSLLKTVQ FCDAMRDAGI VGVLTLDNQD LNGNWYDFGD 

i .... I I .... I ..I ....|....| ....|.. ..| ....|.. ..| 

245 255 265 275 285 295 

EMCR FVVSLPNMGV PCCTSYYSYM MPIMGLTNCL ASECFVKSDI FGSDFKTFDL LKYDFTEHKE 

22 9E FVLCPPGMGI PYCTSYYSYM MPVMGMTNCL ASECFMKSDI FGQDFKTFDL LKYDFTEHKE 

PEDV FTCSIKGMGV PICTSYYSYM MPVMGMTNCL ASECFVKSDI FGEDFKSYDL LEYDFTEHKT 

TGEV FVKTAPGFGC ACVTSYYSYM MPLMGMTSCL ESENFVKSDI YGSDYKQYDL LAYDFTEHKE 

BOCOV YVIAAPGCGV AIADSYYSYM MPMLTMCHAL DCELYVNNAY R LFDL VQYDFTDYKL 

OC43 YVIAAPGCGV AIADSYYSYI MPMLTMCHAL DCELYVNNAY R LFDL VQYDFTDYKL . 

MHV FVKTVPGCGV AVADSYYSYM MPMLTMCHAL DSELFINGTY R EFDL VQYDFTDFKL 

AIPV FQKTAPGAGV PVFDTYYSYM MPIIAMTDAL APERYFEYDV HKG-YKSYDL LKYDYTEEKQ 

SARS CoV FVQVAPGCGV PIVDSYYSLL MPILTLTRAL AAESHMDADL AKP-LIKWDL LKYDFTEERL 

....I.. ..I ....|. ...| ....|....| . . . . | I ....|....| . ... I .... I 

305 315 325 335 345 355 

EMCR NLFNKYFKHW SFDYHPNCSD CYDDMCVIHC ANFNTLFATT IPGTAFGPLC RKVFIDGVPL 

229E VLFNKYFKYW GQDYHPDCVD CHDEMCILHC SNFNTLFATT IPNTAFGPLC RKVFIDGVPV 

PEDV ALFNKYFKYW GLQYHPNCVD CSDEQCIVHC ANFNTLFSTT IPITAFGPLC RKCWIDGVPL 

TGEV YLFQKYFKYW DRTYHPNCSD CTSDECIIHC ANFNTLFSMT IPMTAFGPLV RKVHIDGVPV 

BoCoV ELFNKYFKHW SMPYHPNTVD CQDDRCIIHC ANFNILFSMV LPNTCFGPLV RQIFVDGVPF 

OC43 ELFNKYFKHW SMPYHPNTVD CQDDRCIIHC ANFNILFSMV LPNTCFGPLV RQIFVDGVPF 

MHV ELFNKYFKYW SMTYHPNTCE CEDDRCIIHC ANFNILFSMV LPKTCFGPLV RQIFVDGVPF 

AIPV ELFQKYFKYW DQEYHPNCRD CSDDRCLIHC ANFNILFSTL I PQTS FGNLC RKVFVDGVPF 

SARS CoV CLFDRYFKYW DQTYHPNCIN CLDDRCILHC ANFNVLFSTV FPPTSFGPLV RKIFVDGVPF 
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....I. ...I ....|. ...| ....|....| 1 I • ... I .... I 

365 375 385 395 405 415 

EMCR VTTAGYHFKQ LGLVWNKDVN THSVRLTITE LLQFVTDPSL IIASSPALVD QRTICFSVAA 

22 9E VATAGYHFKQ LGLVWNKDVN THSTRLTITE LLQFVTDPTL IVASSPALVD KRTVCFSVAA 

PEDV VTTAGYHFKQ LGIVWNNDLN LHSSRLSINE LLQFCSDPAL LIASSPALVD QRTVCFSVAA 

TGEV WTAGYHFKQ LGIVWNLDVK LD7MKLSMTD LLRFVTDPTL LVASSPALLD QRTVCFSIAA 

BoCoV WSIGYHYKE LGIVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALYD LRTCCFSVAA 

OC43 WSIGYHYKE LGIVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALYD LRTCCFSVAA 

MHV WSIGYHYKE LGWMNMDVD THRYRLSLKD LLLYAADPAL HVASASALLD LRTCCFSVAA 

AIPV IATCGYHSKE LGVIMNQDNT MSFSKMGLSQ LMQFVGDPAL LVGTSNNLVD LRTSCFSVCA 

SARS CoV WSTGYHFRE LGWHNQDVN LHSSRLSFKE LLVYAADPAM HAASGNLLLD KRTTCFSVAA 

. . . . t I I I ....|....| ....|....| 

425 435 445 455 465 475 

EMCR LSTGLTNQW KPGHFNEEFY NFLRLRGFFD EGSELTLKHF FFAQNGDAAV KDFDFYRYNK 

229E LSTGLTSQTV KPGHFNKEFY DFLRSQGFFD EGSELTLKHF FFTQKGDAAI KDFDYYRYNR 

PEDV LGTGMTNQTV KPGHFNKEFY DFLLEQGFFS EGSELTLKHF FFAQKVDAAV KDFDYYRYNR 

TGEV LSTGITYQTV KPGHFNKDFY DFITERGFFE EGSELTLKHF FFAQGGEAAM TDFNYYRYNR 

BOCOV ITSGVKFQTV KPGNFNQDFY DFILSKGLLK EGSSVDLKHF FFTQDGNAAI TDYNYYKYNL 

OC43 ITSGVKFQTV KPGNFNQDFY DFVLSKGLLK EGSSVDLKHF FFTQDGNAAI TDYNYYKYNL 

MHV ITSGVKFQTV KPGNFNQDFY EFILSKGLLK EGSSVDLKHF FFTQDGNAAI TDYNYYKYNL 

AIPV LTSGITHQTV KPGHFNKDFY DFAEKAGMFK EGSSIPLKHF FYPQTGNAAI NDYDYYRYNR 

SARS CoV LTNNVAFQTV KPGNFNKDFY DFAVSKGFFK EGSSVELKHF FFAQDGNAAI SDYDYYRYNL 

....I.. ..| I I ....|. ...| ,...|. ...| ....|.... | I I 

485 495 505 515 525 535 

EMCR PTILDICQAR VTYKIVSRYF DIYEGGCIKA CEVWTNLNK SAGWPLNKFG KASLYYESIS 

229E PTMLDIGQAR VAYQVAARYF DCYEGGCITS REVWTNLNK SAGWPLNKFG KAGLYYESIS 

PEDV PTVLDICQAR WYQIVQRYF DIYEGGCITA KEVWTNLNK SAGYPLNKFG KAGLYYESLS 

TGEV VTVLDICQAQ FVYKIVGKYF ECYDGGCINA REVWTNYDK SAGYPLNKFG KARLYYETLS 

BoCoV PTMVDIKQLL FVLEWYKYF EIYDGGCIPA AQVIVNNYDK SAGYPFNKFG KARLYYEALS 

OC43 PTMVDIKQLL FVLEWYKYF EIYDGGCIPA SQVIVNNYDK SAGYPFNKFG KARLYYEALS 

MHV PTMVDIKQLL FVLEVVNKYF EIYDGGCIPA TQVIVNNYDK SAGYPFNKFG KARLYYEALS 

AIPV PTMFDICQLL FCLEVTSKYF ECYEGGCIPA SQVWNNLDK SAGYPFNKFG KARLYYE-MS 

SARS CoV PTMCDIRQLL FWEWDKYF DCYDGGCINA NQVIVNNLDK SAGFPFNKWG KARLYYDSMS 

....I. ...| ....|. --.I ....|....| I I ....!....! 

545 555 565 575 585 595 

EMCR YEEQDALFAL TKRNVLPTMT QLNLKYAISG KERARTVGGV SLLSTMTTRQ YHQKHLKSIV 

229E YEEQDAIFSL TKRNILPTMT QLNLKYAISG KERARTVGGV SLLATMTTRQ FHQKCLKSIV 

PEDV YEEQDELYAY TKRNILPTMT QLNLKYAISG KERARTVGGV SLLSTMTTRQ YHQKHLKSIV 

TGEV YEEQDALFAL TKRNVLPTMT QMNLKYAISG KARARTVGGV SLLSTMTTRQ YHQKHLKSIA 

BOCOV FEEQDEIYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM FHQKCLKSIA 

OC43 FEEQDEIYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM FHQKCLKSIA 

MHV FEEQDEVYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM FHQKCLKSIA 

AIPV LEEQDQLFEI TKKNVLPTIT QMNLKYAISA KNRARTVAGV SILSTMTNRQ FHQKILKSIV 

SARS CoV YEDQDALFAY TKRNVIPTIT QMNLKYAISA KNRARTVAGV SICSTMTNRQ FHQKLLKSIA 

....!.... I I I ....|....| I | . ... I .... I ....|....| 

605 615 625 635 645 655 

EMCR NTRNATWIG TTKFYGGWNN MLRTLIDGVE NPMLMGWDYP KCDRALPNMI RMISAMVLGS 

22 9E ATRNATVVIG TTKFYGGWDN MLKNLMADVD DPKLMGWDYP KCDRAMPSMI RMLSAMILGS 

PEDV NTRGASWIG TTKFYGGWDN MLKNLIDGVE NPCLMGWDYP KCDRALPNMI RMISAMILGS 

TGEV ATRNATVVIG STKFYGGWDN MLKNLMRDVD NGCLMGWDYP KCDRALPNMI RMASAMILGS' 

BoCoV ATRGVPWIG TTKFYGGWDD MLRRLIKDVD NPVLMGWDYP KCDRAMPNIL RIVSSLVLAR 

OC43 ATRGVPWIG TTKFYGGWDD MLRRLIKDVD NPVLMGWDYP KCDRAMPNLL RIVSSLVLAR 

MHV ATRGVPWIG TTKFYGGWDD MLRRLIKDVD SPVLMGWDYP KCDRAMPNIL RIISSLVLAR 

AIPV NTRNASWIG TTKFYGGWDN MLRNLIQGVE DPILMGWDYP KCDRAMPNLL RIAASLVLAR 

SARS CoV ATRGATWIG TSKFYGGWHN MLKTVYSDVE TPHLMGWDYP KCDRAMPNML RIMASLVLAR 

....I....! ....|. ...| I .... I ....(....! t I ....I....! 

665 675 685 695 705 715 

EMCR KHVNCCTVTD RFYRLGNELA QVLTEWYSN GGFYFKPGGT TSGDASTAYA NSIFNIFQAV 

22 9E KHVTCCTASD KFYRLSNELA QVLTEWYSN GGFYFKPGGT TSGDATTAYA NSVFNIFQAV 

PEDV KHTTCCSSTD RFFRLCNELA QVLTEWYSN GGFYLKPGGT TSGDATTAYA NSVFNIFQAV 

TGEV KHVGCCTHND RFYRLSNELA QVLTEWHCT GGFYFKPGGT TSGDGTTAYA NSAFNI FQAV 

BOCOV KHEACCSQSD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA NSVFNICQAV 

OC43 KHETCCSQSD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA NSVFNICQAV 

MHV KHDSCCSHTD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA NSVFNICQAV 

AIPV KHTNCCSWSE RIYRLYNECA QVLSETVLAT GGIYVKPGGT SSGDATTAYA NSVFNIIQAT 

SARS COV KHNTCCNLSH RFYRLANECA QVLSEMVMCG GSLYVKPGGT SSGDATTAYA NSVFNICQAV 

....|. ...| ....|.. ..| ....|....| ....|. ...| ,...|....| ....|. ...| 
725 735 745 755 765 775 

EMCR SSNINRLLSV PSDSCNNVNV RDLQRRLYDN CYRLTSVEES FIDDYYGYLR KHFSMMILSD 

22 9E SSNINCVLSV NSSNCNNFNV KKLQRQLYDN CYRNSNVDES FVDDFYGYLQ KHFSMMILSD 

PEDV SANVNKLLSV DSNVCHNLEV KQLQRKLYEC CYRSTIVDDQ FWEYYGYLR KHFSMMILSD 

TGEV SANVNKLLGV DSNACNNVTV KSIQRKIYDN CYRSSSIDEE FVVEYFSYLR KHFSMMILSD 

BOCOV SANVCALMSC NGNKIEDLSI RALQKRLYSH VYRSDMVDST FVTEYYEFLN KHFSMMILSD 

OC43 SANVCALMSC NGNKIEDLSI RALQKRLYSH VYRSDKVDST FVTEYYEFLN KHFSMMILSD 

MHV SANVCSLMAC NGHKIEDLSI RELQKRLYSN VYRADHVDPA FVNEYYEFLN KHFSMMILSD 

AIPV SANVARLLSV ITRDIVYONI KSLQYELYQQ VYRRVNFDPA FVEKFYSYLC KNFSLMILSD 

SARS CoV TANVNALLST DGNKIADKYV RNLQHRLYEC LYRNRDVDHE FVDEFYAYLR KHFSMMILSD 
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1 I I .... I ..I I .... I 

785 795 805 815 825 835 

EMCR DGWCYNKDY AELGYIADIS AFKATLYYQN NVFMSTSKCW VEEDLTKGPH EFCSQHTMQI 

22 9E DSWCYNKTY AGLGYIA.DIS AFKATLYYQN GVFMSTAKCW TEEDLSIGPH EFCSQHTMQI 

PEDV DGWCYNNDY ASLGYVADLN AFKAVLYYQN NVFMSASKCW IEPDINKGPH EFCSQHTMQI 

TGEV DGWCYNKDY ADLGYVADIN AFKATLYYQN NVFMSTSKCW VEPDLSVGPH EFCSQHTLQI 

BoCoV DGWCYNSDY ASKGYIANIS AFQQVLYYQN NVFMSESKCW VENDINNGPH EFCSQHTMLV 

OC43 DGWCYNSDY ASKGYIANIS AFQQVLYYQN NVFMSESKCW VEHDINNGPH EFCSQHTMLV 

MHV DGWCYNSEF ASKGYIANIS AFQQVLYYQN NVFMSEAKCW VETDIEKGPH EFCSQHTMLV 

AIPV DGWCYNNTL AKQGLVADIS GFREVLYYQN NVFMADSKCW VEPDLEKGPH EFCSQHTMLV 

SARS COV DAWCYNSNY AAQGLVASIK NFKAVLYYQN NVFMSEAKCW TETDLTKGPH EFCSQHTMLV 

....I. ...| ....|....| . ... | .... | ....|.. ..| I I ....|. ...I 

845 855 865 875 885 895 

EMCR VDKDGTYYLP YPDPSRILSA GVFVDDWKT DAWLLXRYV SLAIDAYPLS KHPNSEYRKV 

22 9E VDENGKYYLP YPDPSRIISA GVFVDDITKT DAVILLERYV SLAIDAYPLS KHPKPEYRKV 

PEDV VDKEGTYYLP YPDPSRILSA GVFVDDWKT DAVVLLERYV SLAIDAYPLS KHENPEYKKV 

TGEV VGPDGDYYLP YPDPSRILSA GVFVDDIVKT DNVIMLERYV SLAIDAYPLT KHPKPAYQKV 

BoCoV KMDGDDVYLP YPVPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV YHEKEEYQKV 

OC43 KMDGDDVYLP YPNPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV YHENEEYQKV 

MHV KMDGDEVYLP YPDPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV YHENPEYQNV 

AIPV EVDGEPKYLP YPDPSRILGA CVFVDDVDKT EPVAVMERYI ALAIDAYPLV HHENEEYKKV 

SARS CoV KQGDDYVYLP YPDPSRILGA GCFVDDIVKT DGTLMIERFV SLAIDAYPLT KHPNQEYADV 

....|....| . . . . | | ,...|....| ....|. ...| . ... | .... I 

905 915 925 . 935 945 955 

EMCR FYVLLDWVKH LNKNLNEGVL ESFSVTLLDN QEDKFWCEDF YASMYENSTI LQAAGLCVVC 

229E FYALLDWVKH LNKTLNEGVL ESFSVTLLDE HESKFWDESF YASMYEKSTV LQAAGLCWC 

PEDV FYVLLDWVKH LYKTLNAGVL ESFSVTLLED STAKFWDESF YANMYEKSAV LQSAGLCVVC 

TGEV FYTLLDWVKH LQKNLNAGVL DSFSVTMLEE GQDKFWSEEF YASLYEKSTV LQAAGMCVVC 

BOCOV FRVYLEYIKK LYNELGNQIL DSYSVILSTC DGQKFTDESF YKNMYLRSAV MQSVGACWC 

OC43 FRVYLAYIKK LYNDLGNQIL DSYSVILSTC DGQKFTDESF YKNMYLRSAV MQSVGACWC 

MHV FRVYLEYIKK LYNDLGNQIL DSYSVILSTC DGQKFTDETF YKNMYLRSAV MQSVGACWC 

AIPV FFVLLAYIRK LYQELSQNML MDYSFVMDID KGSKFWEQEF YENMYRAPTT LQSCGVCWC 

SARS CoV FHLYLQYIRK LHDELTGHML DMYSVMLTND NTSRYWEPEF YEAMYTPHTV LQAVGACVLC 

. . . . I I ,...|.. ..| ....|.. ..| 

965 975 985 995 1005 1015 

EMCR GSQTVLRCGD CLRKPMLCTK CAYDHVFGTD HKFILAITPY VCNASGCGVS DVKKLYLGGL 

229E GSQTVLRCGD CLRRPMLCTK CAYDHVFGTD HKFILAITPY VCNTSGCNVN DVTKLYLGGL 

PEDV GSQTVLRCGD CLRRPMLCTK CAYDHVIGTT HKFILAITPY VCCASDCGVN DVTKLYLGGL 

TGEV GSQTVLRCGD CLRRPLLCTK CAYDHVMGTK HKFIMSITPY VCSFNGCNVN DVTKLFLGGL 

BoCoV SSQTSLRCGS CIRKPLLCCK CCYDHVMATD HKYVLSVSPY VCNAPGCDVN DVTKLYLGGM 

OC43 SSQTSLRCGS CIRKPLLCCK CCYDHVMATD HKYVLSVSPY VCNAPGCDVN DVTKLYLGGM 

MHV SSQTSLRCGS CIRKPLLCCK CAYDHVMSTD HKYVLSVSPY VCNSPGCDVN DVTKLYLGGM 

AIPV NSQTILRCGN CIRKPFLCCK CCYDHVMHTD HKNVLSINPY ICSQLGCGEA DVTKLYLGGM 

SARS COV NSQTSLRCGA CIRRPFLCCK CCYDHVISTS HKLVLSVNPY VCNAPGCDVT DVTQLYLGGM 

....|....| ....|. ...| . ... I .... I ....|.... I ....!. ...| 
1025 1035 1045 1055 1065 1075 

EMCR NYYCTNHKPQ LSFPLCSAGN IFGLYKNSAT GSLDVEVFNR LATSDWTDVR DYKLANDVKD 

22 9E NYYCVDHKPH LSFPLCSAGN VFGLYKSSAL GSMDI DVFNK LSTSDWSDIR DYKLANDAKE 

PEDV SYWCHEHKPR LAFPLCSAGN VFGLYKNSAT GSPDVEDFNR IATSDWTOVS DYRLANDVKD 

TGEV SYYCMNHKPQ LSFPLCANGN VFGLYKSSAV GSEAVEDFNK LAVSDWTNVE DYKLANNVKE 

BoCoV SYYCEDHKPQ YSFKLVMNGM VFGLYKQSCT GSPYIDDFNR IASCKWTDVD DYILANECTE 

OC43 SYYCEDHKPQ YSFKLVMNGL VFGLYKQSCT GSPYIDDFNR IASCKWTDVD DYILANECTE 

MHV SYYCEDHKPQ YSFKLVMNGM VFGLYKQSCT GSPYIEDFNK IASCKWTEVD DYVLANECTE 

AIPV SYFCGNHKPK LSIPLVSNGT VFGIYRANCA GSENVDDFNQ LATTNWSIVE PYILANRCSD 

SARS CoV SYYCKSHKPP ISFPLCANGQ VFGLYKNTCV GSDNVTDFNA IATCDWTNAG DYILANTCTE 

....I.. ..I ....|. ...| ....|.. ..| ....|. ...| ...,|....| ....|. ...| 
1085 1095 1105 1115 1125 1135 

EMCR TLRLFAAETI KAKEESVKSS YAFATLKEW GPKELLLSWE SGKVKPPLNR NSVFTCFQIS 

22 9E SLRLFAAETV KAKEESVKSS YAYATLKEIV GPKELLLLWE SGKAKPPLNR NSVFTCFQIT 

PEDV SLRLFAAETI KAKEESVKSS YACATLHEW GPKELLLKWE VGRPKPPLNR NSVFTCYHIT 

TGEV SLKIFAAETV KAKEESVKSE YAYAVLKEVI GPKEIVLQWE ASKTKPPLNR NSVFTCFQIS 

BOCOV RLKLFAAETQ KATEEAFKQS YASATIQEIV SERELILSWE IGKVKPPLNK NYVFTGYHFT 

OC43 RLKLFAAETQ KATEEAFKQS YASATIQEIV SERELILSWE IGKVKPPLNK NYVFTGYHFT 

MHV RLKLFAAETQ KATEESFKQC YASATIREIV SDRELILSWE IGKVRPPLNK NYVFTGYHFT 

AIPV SLRRFAAETV KATEELHKQQ FASAEVREVF SDRELILSWE PGKTRPPLNR NYVFTGYHFT 

SARS COV RLKLFAAETL KATEETFKLS YGIATVREVL SDRELHLSWE VGKPRPPLNR NYVFTGYRVT 

....I.. ..| ....I. ...| ....|. ...| . ... I .... I - ... I .... I ....I. ...I 
1145 1155 1165 1175 1185 1195 

EMCR KDSKFQIGEF IFEKVEYGSD TVTYKSTVTT KLVPGMIFVL TSHNVQPLRA PTIANQEKYS 

22 9E KDSKFQVGEF VFEKVDYGSD TVTYKSTATT KLVPGMLFIL TSHNVAPLRA PTMANQEKYS 

PEDV KNTKFQIGEF VFEKAEYDND AVTYKTTATT KLVPGMVFVL TSHNVQPLRA PTIANQERYS 

TGEV KDTKIQLGEF VFEQSEYGSD SVYYKSTSTY KLTPGMIFVL TSHNVSPLKA PILVNQEKYN 

BoCoV KNGKTVLGEY VFDKSEL-TN GVYYRATTTY KLSVGDVFVL TSHSVANLSA PTLVPQENYS 

OC43 KNGKTVLGEY VFDKSEL-TN GVYYRATTTY KLSVGDVFVL TSHSVANLSA PTLVPQENYS 

MHV SNGKTVLGEY VFDKSEL-TN GVYYRATTTY KLSVGDVFIL TSHAVSSLSA PTLVPQENYT 

AIPV RTSKVQLGDF TFEKGEG-KD WYYKATSTA KLSVGDIFVL TSHNWSLVA PTLCPQQTFS 

SARS COV KNSKVQIGEY TFEKGDY-GD AWYRGTTTY KLNVGDYFVL TSHTVMPLSA PTLVPQEHYV 
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....|....| ....I. ...| . ... I .... I I .... I ....I.... I ....|....| 

1205 1215 1225 1235 1245 1255 

EMCR SIYKLHPAFN VSDAYANLVP YYQLIGKQRI TTIQGPPGSG KSHCSIGLGL YYPGARIVFV 

22 9E TIYKLHPSFN VSDAYANLVP YYQLIGKQRI TTIQGPPGSG KSHCSIGIGV YYPGARIVFT 

PEDV TIHKLHPAFN IPEAYSSLVP YYQLIGKQRI TTIQGPPGSG KSHCVIGLGL YYPGARIVFT 

TGEV TISKLYPVFN IAEAYNTLVP YYQMIGKQKF TTIQGPPGSG KSHCVIGLGL YYPQARIVYT 

BoCoV SIR-FASVYS VLETFQNNW NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV YYCTARWYT 

OC43 SIR-FASVYS VLETFQNNW NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV FYCTARWYT 

MHV SIR-FASVYS VPETFQNNVP NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV YYCTARWYT 

AIPV RFVNLRPNVM VPECFVNNIP LYHLVGKQKR TTVQGPPGSG KSHFAIGLAV YFSSARWFT 

SARS COV RITGLYPTLN ISDEFSSNVA NYQKVGMQKY STLQGPPGTG KSHFAIGLAL YYPSARIVYT 

....I. ...| ....|. ...| ....|. ...| . . . . | I ....|. ...| 

1265 1215 1285 1295 1305 1315 

EMCR ACAHAAVOSL CAKAMTVYSI DKCTRI I PAR ARVECYSGFK PNNTSAQYIF STVNALPECN 

22 9E ACSHAAVDSL CAKAVTAYSV DKCTRI I PAR ARVECYSGFK PNNNSAQYVF STVNALPEVN 

PEDV ACSHAAVDSL CVKASTAYSN DKCSRIIPQR ARVECYDGFK SNNTSAQYLF STVNALPECN 

TGEV ACSHAAVDAL CEKAAKNFNV DRCSRIIPQR IRVDCYTGFK PNNTNAQYLF CTVNALPEAS 

BoCoV AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVECYDKFK INDTTRKYVF TTINALPEMV 

OC43 AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVECYDKFK INDTTRKYVF TTINALPEMV 

MHV AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVDCYDKFK VNDTTRKYVF TTINALPELV 

AIPV ACSHAAVDAL CEKAFKFLKV DDCTRIVPQR TTVDCFSKFK ANDTGKKYIF STINALPEVS 

SARS CoV ACSHAAVDAL CEKALKYLPI DKCSRIIPAR ARVECFDKFK VNSTLEQYVF CTVNALPETT 

....I. ...| ....|....| ....|....| ....(....! ....|. ...| 

1325 1335 1345 1355 1365 1375 

EMCR ADIWVDEVS MCTNYDLSVI NQRLSYKHIV YVGDPQQLPA PRVMITKGVM EPVDYNVVTQ 

22 9E ADIWVDEVS MCTNYDLSVI NQRISYKHIV YVGDPQQLPA PRVLISKGVM EPIDYNWTQ 

PEDV ADIWVDEVS MCTNYDLSVI NQRISYRHW YVGDPQQLPA PRVMISRGTL EPKDYNWTQ 

TGEV CDIVVVDEVS MCTNYDLSVI NSRLSYKHIV YVGDPQQLPA PRTLINKGVL QPQDYNWTK 

BOCOV TDIWVDEVS MLTNYELSVI NARIRAKHYV YIGDPAQLPA PRVLLSKGTL EPKYFNTVTK 

OC43 TDIWVDEVS MLTNYELSVI NARIRAKHYV YIGDPAQLPA PRVLLSKGTL EPKYFNTVTK 

MHV TDIIVVDEVS MLTNYELSVI NSRVRAKHYV YIGDPAQLPA PRVLLNKGTL EPRYFNSVTK 

AIPV CDILLVDEVS MLTNYELSFI NGKINYQYW YVGDPAQLPA PRTLLN-GSL SPKDYNVVTN 

SARS COV ADIWFDEIS MATNYDLSVV NARLRAKHYV YIGDPAQLPA PRTLLTKGTL EPEYFNSVCR 

....|....| ...,|. ...| ,...|....| ....|....| . ... | .... | ....|.... I 
1385 1395 1405 1415 1425 1435 

EMCR RMCAIGPDVF LHKCYRCPAE IVNTVSELVY ENKFVPVKPA SKQCFKIFFK G NVQVDN 

229E RMCAIGPDVF LHKCYRCPAE IVNTVSELVY ENKFVPVKEA SKQCFKIFER G SVQVDN 

PEDV RMCALKPDVF LHKCYRCPAE IVRTVSEMVY ENQFI PVHPD SKQCFKIFCK G NVQVDN 

TGEV RMCTLGPDVF LHKCYRCPAE IVKTVSALVY ENKFVPVNPE SKQCFKMFVK G QVQIES 

BoCoV LMCCLGPDIF LGTCYRCPKE IVDTVSALVY ENKLKAKNES SSLCFKVYYK G VTTHES 

OC43 LMCCLGPDIF LGTCYRCPKE IVDTVSALVY ENKLKAKNES SSLCFKVYYK G— VTTHES 

MHV LMCCLGPDIF LGTCYRCPKE IVDTVSALVY HNKLKAKNDN SSMCFKVYYK G QTTHES 

AIPV LMVCVKPDIF LAKCYRCPKE IVDTVSTLVY DGKFIANNPE SRECFKVIVN NGNSDVGHES 
SARS COV LMKTIGPDMF LGTCRRCPAE IVDTVSALVY DNKLKAHKDK SAQCFKMFYK G VITHDV 

....}....! . ... I ... .1 ....!....! 1 I ....|....| 

1445 1455 1465 1475 1485 1495 

EMCR GSSINRKQLE IVKLFLVKNP SWSKAVFISP YNSQNYVASR FLGLQIQTVD SSQGSEYDYV 

22 9E GSSINRRQLD WKRFIHKNS TWSKAVFISP YNSQNYVAAR LLGLQTQTVD SAQGSEYDYV 

PEDV GSSINRRQLD WRMFLAKNP RWSKAVFISP YNSQNYVASR LLGLQIQTVD SSQGSEYDYV 

TGEV NSSINNKQLE WKAFLAHNP KWRKAVFISP YNSQNYVARR LLGLQTQTVD SAQGSEYDYV 

BOCOV SSAVNMQQIY LINKFLKANP LWHKAVFISP YNSQNFAAKR VLGLQTQTVD SAQGSEYDYV 

OC43 SSAVNMQQIY LINKFLKANP LWHKAVFISP YNSQNFAAKR VLGLQTQTVD SAQGSEYDYV 

MHV SSAVNMQQIY LISKFLKANP SWSNAVFISP YNSQNYVAKR VLGLQTQTVD SAQGSEYDFV 

AIPV GSAYNTTQLE FVKDFVCRNK QWREAIFISP YNAMNQRAYR MLGLNVQTVD SSQGSEYDYV 

SARS CoV SSAINRPQIG WREFLTRNP AWRKAVFISP YNSQNAVASK ILGLPTQTVD SSQGSEYDYV 

\ I . . . . I I ....|.. ..) . ... I .... I • ... I • ... I 

1505 1515 1525 1535 1545 1555 

EMCR IYAQTSDTAH ACNVNRFNVA ITRAKKGIFC VMCDKT-LFD SLKFFEIKHA — DLHSS 

229E IFAQTSDTAH ACNANRFNVA ITRAKKGIFC IMSDRT-LFD ALKFFEITMT — DLQSE 

PEDV IYAQTSDTAH ASNVNRFNVA ITRAKKGILC IMCDRS-LFD LLKFFELKLS — DLQAN 

TGEV IYTQTSDTQH ATNVNRFNVA ITRAKVGILC IMCDRT-MYE NLDFYELKDS KIGLQAKP — 

BoCoV IYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSNMQ-LFE ALQFTTLTVD KVPQAVETRV 

OC43 IYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSNMQ-LFE ALQFTTLTLD KVPQAVETKV 

MHV IYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSSMQ-LFE SLNFSTLTLD KIN NPRL 

AIPV I FCVTADSQH ALNINRFNVA LTRAKRGILV VMRQRDELYS ALKFTELDSE TSLQG 

SARS COV I FTQTTETAH SCNVNRFNVA ITRAKIGILC IMSDRD-LYD KLQFTSLEIP RRN-VATLQA 

....(....I ....|.... | . ... I .... | . ... I .... I ....|....| . ... I .... I 
1565 1575 1585 1595 1605 1615 

EMCR -QVCGLFKNC TRTPLNLPPT HAHTFLSLSD QFKTTGDLAV QIGSNN — VC TYEHVISFMG 

22 9E -SSCGLFKDC ARNPIDLPPS HATTYLSLSD RFKTSGDLAV QIGNNN — VC TYEHVISYMG 

PEDV -EGCGLFKDC SRGDDLLPPS HANTFMSLAD NFKTDQYLAV QIGVNG — PI KYEHVISFMG 

TGEV -ETCGLFKDC SKSEQYIPPA YATTYMSLSD NFKTSDGLAV NIG-TK— DV KYANVISYMG 

BoCoV QCSTNLFKDC SKSYSGYHPA HAPS FLA VDD KYKATGDLAV CLGIGD-SAV TYSRLISLMG 

OC43 QCSTNLFKDC SKSYSGYHPA HAPSFLAVDD KYKATGDLAV CLGIGD-SAV TYSRLISLMG 

MHV QCTTNLFKDC SRSYAGYHPA HAPSFLAVDD KYKVGGDLAV CLNVAD-SAV TYSRLISLMG 

AiPV TGLFKIC NKEFSGVHPA YAVTTKALAA TYKVNDELAA LVNVEAGSEI TYKHLISLLG 

SARS CoV ENVTGLFKDC SKIITGLHPT QAPTHLSVDI KFKTEG-LCV DIPGIP-KDM TYRRLISMMG 
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....I. ...I ....|....| ....|.... | ....|....| ....|. ...| 

1625 1635 1645 1655 1665 1675 

EMCR FRFDISIPGS HSLFCTRDFA IRNVRGWLGM DVESAHVCGD NIGTNVPLQV GFSNGVNFW 

22 9E FRFDVSKPGS HSLFCTRDFA MRHVRGWLGM DVEGAHVTGD NVGTNVPLQV GFSNGVDFVA 

PEDV FRFDINIPNH HTLFCTRDFA MRNVRGWLGF DVEGAHWGS NVGTNVPLQL GFSNGVDFW 

TGEV FRFEANIPGY HTLFCTRDFA MRNVRAWLGF DVEGAHVCGD NVGTNVPLQL GFSNGVDFW 

BoCoV FKLDVTLDGY CKLFITKEEA VKRVRAWVGF DAEGAHATRD SIGTNFPLQL GFSTGIDFW 

OC43 FKLDVTLDGY CKLFITKEEA VKRVRAWVGF DAEGAHATRD SIGTNFPLQL GFSTGIDFW 

MHV FKLDLTLDGY CKLFITRDEA IRRVRAWVGF DAEGAHATRD SIGTNFPLQL GFSTGIDFW 

AIPV FKMSVNVEGC HNMFITRDEA IRNVRGWVGF DVEATHACGT NIGTNLPFQV GFSTGADFW 

SARS COV FKMNYQVNGY PNMFITREEA IRHVRAWIGF DVEGCHATRD AVGTNLPLQL GFSTGVNLVA 

....I. ...I ....|.. ..i ....!.. ..| ....i..:. i ....|. ...i 

1685 1695 1705 1715 1725 1735 

EMCR QTEGCVSTNF GDVIKPVCAK SPPGEQFRHL VPFLRKGQPW LIVRRRIVQM ISDYLSNLSD 

22 9E QPEGCVLTNT GSWKPVRAR APPGEQFTHI VPLLRKGQPW SVLRKRIVQM IADFLAGSSD 

PEDV RPEGCWTES GDYIKPVRAR APPGEQFAHL LPLLKRGQPW DWRKRIVQM CSDYLANLSD 

TGEV QTEGCVITEK GNSIEWKAR APPGEQFAHL IPLMRKGQPW HIVRRRIVQM VCDYFDGLSD 

BoCoV EATGLFADRD GYSFKKAVAK APPGEQFKHL IPLMTRGQRW DWRPRIVQM FADHLIDLSD 

OC43 EATGLFADRD GYSFKKAVAK APPGEQFKHL IPLMTRGHRW DWRPRIVQM FADHLIDLSD 

MHV EATGMFAERD GYVFKKAVAR APPGEQFKHL VPLMSRGQKW DWRIRIVQM LSDHLVDLAD 

AIPV TPEGLVDTSI GNNFEPVNSK APPGEQFNHL RVLFKSAKPW HVIRPRIVQM LADNLCNVSD 

SARS COV VPTGYVDTEN NTEFTRVNAK PPPGDQFKHL IPLMYKGLPW NWRIKIVQM LSDTLKGLSD 

....|....| • . . . I I ....|....| I .... I I I 

1745 1755 1765 1775 1785 1795 

EMCR ILVFVLWAGS LELTTMRYFV KIGPIKYCY- CGNSATCYNS VSNEYCCFKH ALGCDYVYNP 

229E VLVFVLWAGG LELTTMRYFV KIGAVKHCQ- CGTVATCYNS VSNDYCCFKH ALGCDYVYNP 

PEDV ILIFVLWAGG LELTTMRYFV KIGPSKSCD- CGKVATCYNS ALHTYCCFKH ALGCDYLYNP 

TGEV ILIFVLWAGG LELTTMRYFV KIGRPQKCE- CGKSATCYSS SQSVYACFKH ALGCDYLYNP 

BOCOV CWLVTWAAN FELTCLRYFA KVGREISCNV STKRATAYNS RTGYYGCWRH SVTCDYLYNP 

OC43 CWLVTWAAN FELTCLRYFA KVGREISCNV CTKRATVYNS RTGYYGCWRH SVTCDYLYNP 

MHV SWLVTWAAS FELTCLRYFA KVGKEWCSV CNKRATCFNS RTGYYGCWRH SYSCDYLYNP 

AIPV CWFVTWCHG LELTTLRYFV KIGKEQVCS- CGSRATTFNS HTQAYACWKH CLGFDFVYNP 

SARS CoV RWFVLWAHG FELTSMKYFV KIGPERTCCL CDKRATCFST SSDTYACWNH SVGFDYVYNP 

....I. ...I ....I. ...| I I I I ....|..-.| . ... I .... I 

1805 1815 1825 1835 1845 1855 

EMCR YAFDIQQWGY VGSLSQNHHT FCNIHRNEHD ASGDAVMTRC LAVHDCFVKN VDWTVTYPFI 

22 9E YVIDIQQWGY VGSLSTNHHA ICNVHRNEHV ASGDAIMTRC LAVYDCFVKN VDWSITYPMI 

PEDV YCIDIQQWGY KGSLSLNHHE HCNVHRNEHV ASGDAIMTRC LAIHDCFVKN VDWSITYPFI 

TGEV YCIDIQQWGY TGSLSMNHHE VCNIHRNEHV ASGDAIMTRC LAIHDCFVKR VDWSIVYPFI 

BoCoV LIVDIQQWGY IGSLSSNHDL YCSVHKGAHV ASSDAIMTRC LAVYDCFCNN INWNVEYPII 

OC43 LIVDIQQWGY IGSLSSNHDL YCSVHKGAHV ASSDAIMTRC LAVYDCFCNN INWNVEYPII 

MHV LIVDIQQWGY TGSLTSNHDL ICSVHKGAHV ASSDAIMTRC LAVHDCFCKS VNWSLEYPII 

AIPV LLVDIQQWGY SGNLQFNHDL HCNVHGHAHV ASVDAIMTRC LAINNAFCQD VNWDLTYPHI 

SARS CoV FMIDVQQWGF TGNLQSNHDQ HCQVHGNAHV ASCDAIMTRC LAVHECFVKR VDWSVEYPII 

I .... I ....I.... | . ... I .... I ....|....| . ... I .... I . ... I .... ! 

1865 1875 1885 1895 1905 1915 

EMCR ANEKFINGCG RNVQGHWRA ALKLYKPSVI HDIGNPKGVR CA-VTDAKWY CYDKQPVNSN 

229E ANENAINKGG RTVQSHIMRA AIKLYNPKAI HDIGNPKGIR CA-VTDAKWY CYDKNPINSN 

PEDV GNEAVINKSG RIVQSHTMRS VLKLYNPKAI YDIGNPKGIR CA-VTDAKWF CFDKNPTNSN 

TGEV DNEEKINKAG RIVQSHVMKA ALKIFNPAAI HDVGNPKGIR CA-TTPIPWF CYDRDPINNN 

BoCoV SNELSINTSC RVLQRVMLKA AMLCNRYTLC YDIGNPKAIA CV — KDFDFK FYDAQPIVKS 

OC43 SNELSINTSC RVLQRVILKA AMLCNRYTLC YDIGNPKAIA CV— KDFDFK FYDAQPIVKS 

MHV SNEVSVNTSC RLLQRVMFRA AMLCNRYDVC YDIGNPKGLA CV — KGYDFK FYDASPWKS 

AIPV ANEDEVNSSC RYLQRMYLNA CVDALKVNVV YDIGNPKGIK CVRRGDVNFR FYDKNPIVRN 

SARS CoV GDELRVNSAC RKVQHMVVKS ALLADKFPVL HDIGNPKAIK CVPQAEVEWK FYDAQPCSDK 

....I. ...I ....I.... | I I ,...|....| ....|. ...| ....|....| 

1925 1935 1945 1955 1965 1975 

EMCR VKLLDYD YATHG — QLD GLCLFWNCNV DMYPEFSIVC RFDTRTRSVF NLEGVNGGSL 

22 9E VKTLEYD YMTHG — QMD GLCLFWNCNV DMYPEFSIVC RFDTRTRSTL NLEGVNGGSL 

PEDV —VKTLEYD YITHG — QFD GLCLFWNCNV DMYPEFSWC RFDTRCRSPL NLEGCNGGSL 

TGEV VRCLDYD YMVHG — QMN GLMLFWNCNV DMYPEFSIVC RFDTRTRSKL SLEGCNGGAL 

BOCOV VKTLLYF FEAHKDSFKD GLCMFWNCNV DKYPPNAVVC RFDTRVLNNL NLPGCNGGSL 

OC43 VKTLLYS FEAHKDSFKD GLCMFWNCNV DKYPPNAVVC RFDTRVLNNL NLPGCNGGSL 

MHV - — VKQFVYK YEAHKDQFLD GLCMFWNCNV DKYPANAWC RFDTRVLNKL NLPGCNGGSL 

AIPV - — VKQFEYD YNQHKDKFAD GLCMFWNCNV DCYPDNSLVC RYDTRNLSVF NLPGCNGGSL 

SARS COV AYKIEELFYS YATHHDKFTD GVCLFWNCNV DRYPANAIVC RFDTRVLSNL NLPGCDGGSL 

....I. ...I I I I | I | ....|. ...| 

1985 1995 2005 2015 2025 2035 

EMCR YVNKHAFHTP AYDKRAFVKL KPMPFFYFDD SDCDVVQ EQVNYVPLR ASSCVTRCNI 

22 9E YVNNHAFHTP AYDKRAMAKL KPAPFFYYDD GSCEWH DQVNYVPLR ATNCITKCNI 

PEDV YVNNHAFHTP AFDKRAFAKL KPMPFFFYDD TECDKLQ DSINYVPLR ASNCITKCNV 

TGEV YVNNHAFHTP AYDRRAFAKL KPMPFFYYDD SNCELVD GQPNYVPLK SNVCITKCNI 

BoCoV YVNKHAFHTK PFSRAAFEHL KPMPFFYYSD TPCVYMDGMD AKQVDYVPLK SATCITRCNL 

OC43 YVNKHAFHTK PFARAAFEHL KPMPFFYYSD TPCVYMDGMD AKQVDYVPLK SATCITRCNL 

MHV YVNKHAFHTS PFTRAAFENL KPMPFFYYSD TPCVYMEGME SKQVDYVPLR SATCITRCNL 

AIPV YVNKHAFYTP KFDRISFRNL KAMPFFFYDS SPCETIQ-VD GVAQDLVSLA TKDCITKCNI 

SARS CoV YVNKHAFHTP AFDKSAFTNL KQLPFFYYSD SPCESHGKQV VSDIDYVPLK SATCITRCNL 
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2045 2055 2065 2075 2085 2095 

EMCR GGAVCSKHAN LYQKYVBAYN TFTQAGFNIW VPHSFDVYNL WQIFIET-NL QSLENIAFNV 

229E GGAVCSKHAN LYRAYVESYN IFTQAGFNIW VPTTFDCYNL WQTFTEV-NL QGLENIAFNV 

PEDV GGAVCSKHCA MYHSYVNAYN TFTSAGFTIW VPTSFDTYNL WQTFSN — NL QGLENIAFNV 

TGEV GGAVCKKHAA LYRAYVEDYN IFMQAGFTIW CPQNFDTYML WHGFVNSKAL QSLENVAFNV 

BoCoV GGAVCLKHAE EYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTK L QSLENWYNL 

OC43 GGAVCLKHAE EYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTK L QSLENWYNL 

MHV GGAVCLKHAE DYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTR L QSLENWYNL 

AIPV GGAVCKKHAQ MYAEFVTSYN AAVTAGFTFW VTNKLNPYNL WKSFSA L QSIDNIAYNM 

SARS CoV GGAVCRHHAN EYRQYLDAYN MMISAGFSLW IYKQFDTYNL WNTFTR— L QSLENVAYNV 



.1 . 



EMCR 

229E 

PEDV 

TGEV 

BoCoV 

OC43 

MHV 

AIPV 

SARS CoV 



2105 
VKKGCFTGVD 
VNKGSFVGAD 
LKKGSFVGDE 
VKKGAFTGLK 
VKTGHYTGQA 
VKTGHYTGQA 
VNAGHFDGRA 
YKGGHYDAIA 
VNKGHFDGHA 



,...|. ...I 

2115 
GELPVAWND 
GELPVAI SGD 
GELPVAVVND 
GDLPTAVIAD 
GEMPCAIIND 
GEMPCAIIND 
GEL PC AVI GE 
GEMPTVITGD 
GEAPVSIINN 



....|. ...I 

2125 
KVFVRYGDVD 
KVFVRDGNTD 
KVLVRDGTVD 
KIMVRDGPTD 
KWAKIDKED 
KWAKIDKED 
KVIAKIQNED 
KVFVIDQGVE 
AVYTKVDGID 



2135 
NLVFTNKTTL 
NLVFVNKTSL 
TLVFTNKTSL 
KCIFTNKTSL 
VVIFINNTTY 
WIFINNTTY 
VWFKNNTPF 
KAVFVNQTTL 
VEIFENKTTL 



2145 
PTNVAFELFA 
PTNIAFELFA 
PTNVAFELYA 
PTNVAFELYA 
PTNVAVELFA 
PTNVAVELFA 
PTNVAVELFA 
PTSVAFELYA 
PVNVAFELWA 



,...| | 

2155 
KRKMGLTPPL 
KRKVGLTPPL 
KRKVGLTPPI 
KRKLGLTPPL 
KRSIRHHPEL 
KRSVRHHPEL 
KRSIRPHPEL 
KRNIRTLPNN 
KRNIKPVPEI 



....I. ...I ....|....| ....|....| ....|.... I ....|. ...| 

2165 2175 2185 2195 2205 2215 

EMCR SILKNLGVVA TYKFVLWDYE AERPFTSYTK SVCKYTDFN EDV CVCFDNSIQG 

229E SILKNLGWA TYKFVLWDYE AERPLTSFTK SVCGYTDFA EDV CTCYDNSIQG 

PEDV TILRNLGVVC TSKCVIWDYE AERPLTTFTK DVCKYTDFE- GOV CTLFDNSIVG 

TGEV TILRNLGVVA TYKFVLWDYE AERPFSNFTK QVCSYTDLD SEV VTCFDNSIAG 

BoCoV KLFRNLNIDV CWKHVIWDYA RESIFCSNTY GVCMYTDLK LIDKL NVLFDGRDNG 

OC43 KLFRNLNIDV CWKHVIWDYA RESIFCSNTY GVCMYTDLK FIDKL NVLFDGRDNG 

MHV KLFRNLNIDV CWSHVLWDYA KDSVFCSSTY KVCKYTDLQ CIESL NVLFDGRDNG 

AIPV RILKGLGVDV TNGFVIWDYA NQTPLYRNTV KVCAYTDIE PNGL VVLYDDR-YG 

SARS CoV KILNNLGVDI AANTVIWDYK REAPAHVSTI GVCTMTDIAK KPTESACSSL TVLFDGRVEG 



....I. ...I ....|.. ..| ....|. ...| ....|. ...| ....(.. .,| ....|. ...| 
2225 2235 2245 2255 2265 2275 

EMCR SYERFTLTTN AVLFSTWIK N LTPIK LNFGMLNGMP VSSIKSDKGV EKLVNWYTYV 

229E SYERFTLSTN AVLFSATAVK TGGKSLPAIK LNFGMLNGNA IATVKSEDGN IKNINWFVYV 

PEDV SLERFSMTQN AVLMSLTAVK K LTGIK LTYGYLNGVP VNTHED KPFTWYIYT 

TGEV SFERFTTTRD AVLISNNAVK G LSAIK LQYGLLNDLP VSTVGN KPVTWYIYV 

BoCoV ALEAFKRSNN GVYISTTKVK S LSMIR GPPRAELNGV VVDKVGD TDCVFYFAV 

OC43 ALEAFKRSNN GVYISTTKVK S LSMIR GPPRAELNGV WDKVGD TDCVFYFAV 

MHV ALEAFKKCRD GVYINTTKIK S LSMIK GPQRADLNGV WEKVGD SDVEFWFAM 

AIPV DYQSFLAADN AVLVSTQCYK R YSYVE IPSNLLVQNG MPLKDG ANLYVYK 

SARS COV QVDLFRNARN GVLITEGSVK G LTPSK GPAQASVNGV TLIGES VKTQFNYFK 



EMCR 

229E 

PEDV 

TGEV 

BOCOV 

OC43 

MHV 

AIPV 

SARS COV 



.. ..|.... I 
2285 

RKNG 

RKDG 

RKNG 

RKNG 



..I ....| 
2295 

QFQDH 

KPVDH 

KFEDY 

-EYVEQ 



RKEGQDVIFS QFDSLRVSSN 
RKEGQDVIFS QFDSLGVSSN 
RRDGDDVIFS RTGSLEPSHY 

RVNG AFVTL 

KVDG IIQQL 



....I. ...I ,...|. ...| ....|.... | ....|. ...| 
2305 2315 2325 2335 

Y DGFYTQ GRNLSDFTPR 

Y DGFYTQ GRNLQDFLPR 

p DGYFTQ GRTTADFSPR 

I DSYYTQ GRTFETFKPR 

QSPQGNLGSN -EPGNVGGND ALATSTIFTQ SRVISSFTCR 
QSPQGNLGSN GKPGNVGGND ALSISTIFTQ SRVISSFTCR 
RSPQGNPGGN -RVGDLSGNE ALARGTIFTQ SRFLSSFAPR 

p NTINTQ GRSYETFEPR 

p ETYFTQ SRDLEDFKPR 



.-..I. .-.I ....|.-.. | ....!....! ....|. ...| I I ....I. ...I 

2345 2355 2365 2375 2385 2395 

EMCR SDMEYDFLNM DMGVFINKYG LEDFNFEHW YGDVSKTTLG GLHLLISQFR LSKMGVLKAD 

. 229E STMEEDFLNM DIGVFIQKYG LEDFNFEHW YGDVSKTTLG GLHLLISQVR LSKMGILKAE 

PEDV SDMEKDFLSM DMGLFINKYG LEDYGFEHW YGDVSKTTLG GLHLLISQVR LACMGVLKID 

TGEV STMEEDFLSM DTTLFIQKYG LEDYGFEHW FGDVSKTTIG GMHLLISQVR LAKMGLFSVQ 

BOCOV TDMEKDFIAL DQDVFIQKYG LEDYAFEHIV YGNFNQKIIG GLHLLIGLYR RQQTSNLVIQ 

OC43 TDMEKDFIAL DQDVFIQKYG LEDYAFEHIV YGNFNQKIIG GLHLLIGLYR RQQTSNLWQ 

MHV SEMEKDFMDL DEDVFIAKYS LQDYAFEHVV YGSFNQKIIG GLHLLIGLAR RQQKSNLVIQ 

AIPV SDIERDFLAM SEESFVERYG -KDLGLQHIL YGEVDKPQLG GLHTVIGMYR LLRANKLNAK 

SARS COV SQMETDFLEL AMDEFIQRYK LEGYAFEHIV YGDFSHGQLG GLHLMIGLAK RSQDSPLKLE 



...,|....| ....|....| ....|. ...| ...,|.,..| ....|....| ....|.... | 

2405 2415 2425 2435 2445 2455 

EMCR DFVTASDTTL RCCTVTYLNE LSSKVVCTYM DLLLDDFVTI LK SLDLG VISKVHEVII 

22 9E EFVAASDITL KCCTVTYLND PSSKTVCTYM DLLLDDFVSV LK— SLDLT WSKVHEVII 

PEDV EFVSSNDSTL KSCTVTYADN PSSKMVCTYM DLLLDDFVSI LK SLDLS WSKVHEVMV 

TGEV EFMNNSDSTL KSCCITYADD PSSKNVCTYM DILLDDFVTI IK— SLDLN VVSKWDVIV 

BoCoV EFVS-YDSSI HSYFITDEKS GGSKSVCTVI DILLDDFVAL VK— -SLNLN CVSKWNVNV 

OC43 EFVS-YDSSI HSYFITDEKS GGSKSVCTVI DILLDDFVAL VK— SLNLN CVSKWNVNV 

MHV EFVP-YDSSI HSYFITDENS GSSKSVCTVI DLLLDDFVDI VK— SLNLN CVSKWNVNV 

AIPV SVTN-SDSDV MQNYFVLSDN GSYKQVCTVV DLLLDDFLEL LRNILKEYGT NKSKWTVSI 

SARS COV DFIP-MDSTV KNYFITDAQT GSSKCVCSVI DLLLDDFVEI IK— SQDLS VISKWKVTI 
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2465 2475 2485 2495 2505 2515 

EMCR DNKPYRWMLW CKDNHLSTFY PQLQS-AEWK CGYAMPQIYK LQRMCLEPCN LYNYGAGIKL 

229E DNKPWRWMLW CKDNAVATFY PQLQS-AEWK CGYSMPGIYK TQRMCLEPCN LYNYGAGLKL 

PEDV DCKMWRWMLW CKDHKLQTFY PQLQA-SEWK CGYSMPSIYK IQRMCLEPCN LYNYGAGVKL 

TGEV DCKAWRWMLW CENSHIKTFY PQLQS-AEWK PGYSMPTLYK IQRMCLERCN LYKYGAQVKL 

BoCoV DFKDFQFMLW CNDEKVMTFY PRLQAASDWK PGYSMPVLYK YLNSPMERVS LWNYGKPVTL 

OC43 DFKDFQFMLW CNDEKVMTFY PRLQAASDWK PGYSMPVLYK YLNSPMERVS LWNYGKPVTL 

MHV DFKDFQFMLW CNEEKVMTFY PRLQAAADWK PGYVMPVLYK YLESPLERVN LWNYGKPITL 

AIPV DYHSINFMTW FEDGSIKTCY PQLQS — AWT CGYNMPELYK VQNCVMEPCN IPNYGVGITL 

SARS CoV DYAEISFMLW CKDGHVETFY PKLQASQAWQ PGVAMPNLYK MQRMLLEKCD LQNYGENAVI 

....I. ...I ....|. ...| ....|....| ....|.. ..| ....|.. ..| I .... I 

2525 2535 2545 2555 2565 2575 

EMCR PSGIMLNWK YTQLCQYLNS TTMCVPHNMR VLHYGAGSDK GVAPGTTVLK RWLPPD 

22 9E PSGIMFNVVK YTQLCQYFNS TTLCVPHNMR VLHLGAGSDY GVAPGTAVLK RWLPHD 

PEDV PDGIMFNVVK YTQLCQYLNS TTMCVPHHMR VLHLGAGSDK GVAPGTAVLR RWLPLD 

TGEV PDGITTNWK YTQLCQYLNT TTLCVPHKMR VLHLGAAGAS GVAPGSTVLR RWLPDD 

BoCoV PTGCMMNVAK YTQLCQYLNT TTLAVPVNTR VLHLGAGSEK GVAPGSAVLR QWLPAGTILR 

OC43 PTGCMMNVAK YTQLCQYLNT TTLAVPVNMR VLHLGAGSEK GVAPGSAVLR QWLPAG 

MHV PTGCLMNVAK YTQLCQYLNT TTLAVPANMR VLHLGAGSDK DVAPGSAVLR QWLPAG 

AIPV PSGILMNVAK YTQLCQYLSK TTICVPHNMR VMHFGAGSDK GVAPGSTVLK QWLPEG 

SARS CoV PKGIMMNVAK YTQLCQYLNT LTLAVPYNMR VIHFGAGSDK GVAPGTAVLR QWLPTG 

....I. ...I ....|. ...| I I ....|....| I .... I 

2585 2595 2605 2615 2625 2635 

EMCR AIII DNDINDYVSD ADFSITGDCA TVYLEDKFDL LISDMYDG RIKFCDGE 

229E AIW DNDVVDYVSD ADFSVTGDCA TVYLEDKFDL LISDMYDG RTKAIDGE 

PEDV AIIV DNDSVDYVSD ADYSVTGDCS TLYLSDKFDL VISDMYDG KIKSCDGE 

TGEV AILV DNDLRDYVSD ADFSVTGDCT SLYIEDKFDL LVSDLYDG STKSIDGE 

BoCoV QWLPAGTILV HNDLYPFVSD SVATYFGDCI TLPFDCQWDL IISDMYD LLLDIGVH 

OC43 TILV DNDLYPFVSD SVATYFGDCI TLPFDCQWDL IISDMYDP ITKNIGEY 

MHV SILV DNDINPFVSD SVASYYGNCI TLPIACQWDL IISDMYDP LTKNIGEY 

AIPV TLLV DNDIVDYVSD AHVSVLSDCN KYNTEHKFDL VISDMYTDND SKRKHEGVIA 

SARS CoV TLLV DSDLNDFVSD ADSTLIGDCA TVHTANKWDL IISDMYDP RTKHVTKE 

■ ... I .... I ....I.. ..I ....|. ...I . ... I .... I ....J....I 

2645 2655 2665 2675 2685 2695 

EMCR NVSKDGFFTY LNGVIREKLA IGGSVAIKIT EYSWNKYLYE LIQRFAFWTL FCTSVNTSSS 

22 9E NVSKEGFFTY INGFICEKLA IGGSIAIKVT EYSWNKKLYE LVQRFS FWTM FCTSVNTSSS 

PEDV NVSKEGFFPY INGVITEKLA LGGTVAIKVT EFSWNKKLYE LIQKFEYWTM FCTSVNTSSS 

TGEV NTSKDGFFTY INGFIKEKLS LGGSVAIKIT EFSWNKDLYE LIQRFEYWTV FCTSVNTSSS 

BOCOV VVRCS YI HCHMIRDKLA LGGSVAIKIT EFSWNAELYK LMGYFAFWTV FCTNANASSS 

OC43 NVSKDGFFTY ICHMIRDKLA LGGSVAIKIT EFSWNAELYK LMGYFAFWTV FCTNANASSS 

MHV NVSKDGFFTY LCHLIRDKLA LGGSVAIKIT EFSWNAELYS LMGKFAFWTI FCTNVNASSS 

AIPV NNGNDDVFIY LSSFLRNNLA LGGSFAVKVT ETSWHEVLYD IAQDCAWWTM . FCTAVNA5SS 

SARS COV NDSKEGFFTY LCGFIKQKLA LGGSIAVKIT EHSWNADLYK LMGHFSWWTA FVTNVNASSS 

I .... I ....|....| . ... I .... I I I . ... I .... I 

2705 2715 2725 2735 2745 2755 

EMCR EAFLIGINYL GDFIQGPFIA GNTVHANYIF WRNSTIMSLS YNSVLDLSKF ECKHKATWV 

22 9E EAFWGINYL GDFAQGPFID GNIIHANYVF WRNSTVMSLS YNSVLDLSKF NCKHKATWV 

PEDV EAFLIGVHYL GDFASGAVID GNTMHANYIF WRNSTIMTMS YNSVLDLSKF NCKHKATWV 

TGEV EGFLIGINYL GPYCDKAIVD GNIMHANYIF WRNSTIMALS HNSVLDTPKF KCRCNNALIV 

BoCoV EGFLIGINYL GK--PKVEID GNVMHAIICF G EIPQFGTGVL 

OC43 EGFLIGINYL CK--PKVEID GNVMHANYLF WRNSTVWNGG AYSLFDMAKF PLKLAGTAVI 

MHV EGFLIGINWL NR--TRTEID GKTMHANYLF WRNSTMWNGG AYSLFDMSKF PLKVAGTAVV 

AIPV EAFLIGVNYL GAS-EKVKVS GKTLHANYIF WRNCNYLQTS AYSIFDVAKF DLRLKATPW 

SARS CoV EAFLIGANYL GK — PKEQID GYTMHANYIF WRNTNPIQLS SYSLFDMSKF PLKLRGTAVM 

....|. ...| ....|.. ..| 
2765 2775 2785 2795 

EMCR TLKDSDVNDM VLSLIKSGRL LLRNSGRFGG FSNHLVSTK- 

22 9E QLKDSDINEM VLSLVRSGKL LVRGNGKCLS FSNHLVSTK- 

PEDV NLKDSSISDV VLGLLKNGKL LVRNNDAICG FSNHLVNVNK 

TGEV NLKEKELNEM VIGLLRKGKL LIRNNGKLLN FGNHFVNTP- 

BoCoV IACLIWLNSR LSWLVMP 

OC43 NLRADQINDM VYSLLEKGKL LIRDTNKEVF VGDSLVNVI- 

MHV SLKPDQINDL VLSLIEKGKL LVRDTRKEVF VGDSLVNVK- 

AIPV NLKTEQKTDL VFNLIKCGKL LVRDVGNTSF TSDSFVCTM- 

SARS CoV SLKENQINDM IYSLLEKGRL IIRENNRVW SSDILVNN — 
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— i .... i . ... i .... i — i .... i — \ — i 

5 15 25 35 45 55 

EMCR M FYNQVTLAVA SDSEISGFGF AIPSVAVRAY SEAAAQGFQA 

22 9E M ACNRVTLAVA SDSEISANGC STIAQAVRRY SEAASNGFRA 

PEDV M ASNHVTLAFA NDAEISAFGF CTASEAVSYY SEAAASGFMQ 

TGEV M SSKQFKILVN EDYQVNVPSL PIR-DVLQEI KYCYRNGFEG 

OV43 MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDMIC STTAQKLETD GICPENHVMV 

BoCoV MSKINKYGLE LHWAPEFPWM FEDAEEKLDN PSSSEVDIVC STTAQKLETG GICPENHVMV 

MHV MAKMGKYGLG FKWAPEFPWM LPNASEKLGS PERSEEDGFC PSAAQEPKTK GKTLINHVRV 

AIBV MASSLKQGVS PKPRDVILVS KDI PEQLCDA LFFYTSHNPK 

SARS CoV — MESLVLGV NEKTHVQLSL PVLQVRDVLV RGFGDSVEEA LSEAREHLKN 

....|.. ..| . ... I .... I ....|....| 
65 75 85 95 105 115 

EMCR CRFVAFGLQD CVTGINDDDY VIALTG TNQLCAKILL FSDRPLNLRG 

229E CRFVSLDLQD CIVGIADOTY VMGLHG NQTLFCNIMK FSDRPFMLHG 

PEDV CRFVSLDLAD TVEGLLPEDY VMWIG TTKLSAYVDT FGSRPRNICG 

TGEV YVFVPEYCRD LVDCDRKDHY VIGVLG NGVSDLKPVL LTEPSVMLQG 

OV43 DCRRLLKQEC CVQSSLIREI VMNASPYDLE VLLQDALQSR EAVLVTTPLG MSLEACYVRG 

BOCOV DCRRLLKQEC CVQSSLIREI VMNTRPYDLE VLLQDALQSR EAVLVTPPLG MSLEACYVRG 

MHV DCSRLPALEC CVQSAIIRDI FVDEDPLNVE ASTMMALQFG SAVLVKPSKR LSIQAWAKLG 

AIBV DYADAFAVRQ KFDRSLQTGK QFKFET V CGLFLLKGVD KITPGVPAKV 

SARS CoV GTCGLVELEK GVLPQLEQPY VFIKR--SDA LSTNHGHKW ELVAEMDGIQ YGRSGITLGV 

....I.... I ...,|....| ....|.... | ....|.... | ....|....| ....|.... | 
125 135 145 155 165 175 

EMCR WLIFSNSNYV LQDFDWFG- -HGAGSWFV DKYMCGFDGK PVLP--KNMW EFRDYFNDNT 

229E WLVFSNSNYL LEEFDWFGK -RGGGNVTYT DQYLCGADGK - PVMS--EDLW QFVDHFGENE 

PEDV WLLFSNCNYF LEELELTFG- -RRGGNIVPV DQYMCGADGK PVLQ— ESEW EYTDFFADSE 

TGEV FIVRANCNGV LEDFDLKIA- -RTGRGAIYV DQYMCGADGK PVIE--G DFKDYFGDED 

OV43 CNPKGWTMGL FRRRSVCNTG RCTVNKHVAY QLYMIDPAGV CLGAGQFVGW VIPLAFMPVQ 

BOCOV CNPNGWTMGL FRRRSVCNTG RCAVNKHVAY QLYMIDPAGV CFGAGQFVGW VIPLAFMPVQ 

MHV VLPKTPAMGL FKRFCLCNTR ECVCDAHVAF QLFTVQPDGV CLGNGRFIGW FVPVTAIPAY 

AIBV LKATSKLADL EDIFGVSPLA RKYRELLKTA CQWSLTVEAL DVR AQ TLDEIFDPTE 

SARS CoV LVPHVGETPI AYRNVLLRK NGNKGAGG HSYGIDLKSY DLG — DELGT DPIEDYEQNW 

....|. ...| ...,|. ...| . . . . I | ....|. ...| | I 

185 195 205 215 225 235 

EMCR DS-IVIGGVT YQLAWDVIRK DLSYEQQNVL AIESIHYLG- TTGHTLKSGC KLINAKPPKY 

22 9E E--IIINGHT YVCAWLTKRK PLDYKRQNNL AIEEIEYVHG DALHTLRNGS VLEMAKEVKT 

PEDV DGQLNIAGIT YVKAWIVERS DVSYASQNLT SIKSITYCS- TYEHTFLDGT AMKVARTPKI 

TGEV — IIEFEGEE YHCAWTTVRD EKPLNQQTLF TIQEIQYNL- DIPHKLPNCA TRHVAPPVKK 

OV43 SRKFIVPWVM YLRKRGEKGA YNKDHGRGGF GH-VYDFKVE DAYDQVHDEP KGKFSKKAYA 

BOCOV SRKFIAPWVM YLRKCGEKGA YIKDYKRGGF EH-VYNFKVE DAYDLVHDEP KGKFSKKAYA 

MHV AKQWLQPWSI LLRKGGNKGS VTSGHFRRAV TMPVYDFNVE DACEEVHLNP KGKYSRKAYA 

AIBV ILWLQVAARI HVSSMAMRRL VGEVTAKVMD ALG SNLSALFQIV KQQIARIFQK 

SARS CoV NTKHGSGALR ELTRELNGGA VTRYVDNNFC GPDGYPLDCI KDFLARAGKS MCTLS-EQLD 

....I.. ..I I ....|....| ... .|.... | ....|.... | ....|.. ..I 

245 255 265 275 285 295 

EMCR SSKVVLSGEW NAVYKAFGS P FITNGISLLD IIVKPVFFNA FVKCNCGSEN WSVGAWDGYL 

22 9 E SSKVVLSDAL DKLYKVFGSP VMTNGSNILE AFTKPVFISA LVQCTCGTKS WSVGDWTGFK 

PEDV KKNVVLSEPL ATIYREIGSP FVDNGSDARS IIRRPVFLHA FVKCKCGSYH WTVGDWTSYV 

TGEV NSKIVLSEDY KKLYDIFGSP FMGNGDCLSK CFDTLHFIAA TLRCPCGSES SGVGDWTGFK 

OV43 LIRGYRGVKP LLYVDQYGCD YTGSLADGLE AYADKTLQEM KALFPTWSQE LLFDVIVAWH 

BoCoV LIRGYRGVKP LLYVDQYGCD YTGGLADGLE AYADKTLQEM KALFPIWSQE LPFDVTVAWH 

MHV LLKGYRGVKS ILFLDQYGCD YTGRLAKGLE DYGDCTLEEM KELFPVWCDS LDNEVVVAWH 

AIBV ALAIFENVNE LPQRIAALKM AFAKCARSIT VVVVERTLVV KEFAGTCLAS INGAVAKFFE 

SARS CoV YIESKRGVYC CRDHEHEIAW FTERSDKSYE HQTPFEIKSA KKFDTFKGEC PKFVFPLNSK 

• ... I .... I ....|....| ....|....| ....|. ...| I .... I 

305 315 325 335 345 355 

EMCR SSCCGTPAKK LCWPGNVVP GDVIITSTDA GCGVKYYAGL VVKHITNITG VSLWRVTAVH 

22 9E SSCCNVISNK LCVVPGNVKP GDAVITTQQA GAGIKYFCGM TLKFVANIEG VSVWRVIALQ 

PEDV STCCGFKCKP VLVASCSAMP GSVVVTRAGA GTGVKYYNNM FLRHVADIDG LAFWRILKVQ 

TGEV TACCGLSGKV KGVTLGDIKP GDAWTSMSA GKGVKFFANC VLQYAGDVEG VSIWKVIKTF 

OV43 VVRDPRYVMR LQSAATIRSV AYVANPTEDL CDGSVVIKEP VHVYADDSII LRQYNLVDIM 

BoCoV VVRDPRYVMR LQSASTIRSV AYVANPTEDL CDGSVVIKEP VHVYADDSII LRQHNLVDIM 

MHV VDRDPRAVMR LQTLATIRSI GYVGQPTEDL VDGDVWREP AHLLAANAIV KRLPRLVETM 

AIBV ELPNGFMGSK IFTTLAFFKE AAVRVVENIP NAPRGTKGFE VVGNAKGTQV VVRGMRNDLT 

SARS CoV VKVIQPRVEK KKTEGFMGRI RSVYPVASPQ ECNNMHLSTL MKCNHCDEVS WQTCDFLKAT 
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,...|.... | I .... I . ... 1 .... I | 1 

365 375 385 395 405 415 

EMCR SDGMFVATSS YDALLHRNSL DPFCFDVNTL LSNQLRLAFL GASVTEDVKF A AST 

22 9E SVDCFVASST FVEEEHVNRM DTFCFNVRNS VTDECRLAML GAEMTSNVRR Q VAS 

PEDV SKDDLACSGK FLEHHEEGFT DPCYFLNDSS LATKLKFDIL SGKFSDEVKQ A IIA 

TGEV TVDETVCTPG FEGELN DFIKPSSKSL VACSVKRAFI TGDIDDAVHD C IIT 

OV43 SHFYMEADTV VNAFYGVALK DCGFVMQFGY IDCEQDSCDF KGWIPGNMID G FACTTC 

BoCoV SCFYMEADAV VNAFYGVDLK DCGFVMQFGY IDCEQDLCDF KGWVPGNMID G FACTTC 

MHV LYT DSSV TEFCYKTKLC DCGFITQFGY VDCCGDACDF RGWVPGNMMD G FLCPGC 

AIBV LLOQKAOIPV EPEGWS AILDGHLC YVFRSGDRFY AAPLSGNFAL S 

SARS COV , CEHCGTENLV IEGPTTCGYL PTNAVVKMPC PACQDPEIGP EHSVADYHNH SNIETRLRKG 

I .... I ..-.}... .1 ....|....| ....t....| ....|....| 

425 435 445 455 465 ' 475 

EMCR GVIDISAGMF GLYDDILTNN KPWFVRKASG LFDAIWDAFV AAIKLVPTTT GGLVRFVKSI 

22 9E GVIDISTGWF DVYDDIFAES KPWFVRKAED IFGPCWSALA SALKQLKVTT GELVRFVKSI 

PEDV GHVVVGSALV OIVDDALG — QPWFIRKLGD LASAPWEQLK AWRGLGLLS DEWLFGKRL 

TGEV GKLDLSTNLF GNVGLLFKK- TPWFVQKCGA LFVDAWKWE ELCGSLTLTY KQIYEWASL 

OV43 GHVYEVGDLI AQSSGVLPVN PVLHTKSAAG YGGFGCKDSF TLYGQTWYF GGCVYWSPAR 

BoCoV GHVYETGDLL AQSSGVLPVN PVLHTKSAAG YGGFGCKDSF TLYGQTWYF GGCVYWSPAR 

MHV SKSYMPWELE AQSSGVIPKG GVLFTQSTDT VN RESF KLYGHAWPF GSAVYWSPYP 

AIBV -DVHCCERVV CLSDGVTP EIN--DGL ILAAIYSSFS VSELVTALKK GEPFKFLGHK 

SARS COV GRTRCFGGCV FAYVGCYNKR AYWVPRASAD IG SGHT GITGDNVETL NEDLLEILSR 

....|.. ..| ....|....| ....|....| ....|....| ....1 I 

485 495 505 515 525 535 

EMCR ASTVLTVSNG VIIMCADVPD AFQPVYRTFT QAICAAFDFS LDVFKIG 

229E CNSAVAWGG TIQILASVPE KFLNAFDVFV TAIQTVFDCA VETCTIA 

PEDV SCATLSIVNG VFEFLADVPE KLAAAVTVFV NFLNEFFESA CDCLKVG 

TGEV CTSAFTIVNY KPTFWPD-N RVKDLVDKCV KVLVKAFDVF TQIITIAG 

OV43 NIWIPILKSS VKSYDSLVYT GVLGCKAIVK ETNLICKALY LDYVQHKCGN LHQRELLGVS 

BoCoV NIWIPILKSS VKSYDGLVYT GWGCKAIVK ETNLICKALY LDYVQHKCGN LHQRELLGVS 

MHV GMWLPVIWSS VKSYADLTYT GWGCKAIVQ ETDAICRSLY MDYVQHKCGN LEQRAILGLD 

AIBV FVYAKDA AVSFTLAKAA TIADVLRLFQ SARVIAEDVW SSFTEKS 

SARS COV ERVNINI VGDFHLNEEV AULAS FSAS TSAFIDTIKS LDYKSFKT-I VESCGNYKVT 

...,|. ...| ....|. ...| . ... I .... | - ... I .... I 
545 555 565 575 585 595 

EMCR DVKFKR LGDYVLTENA LVRLTTEVVR GVRD A 

229E GKAFDK VFDYVLLDNA LVKLVTTKLK GVRE R 

PEDV GKTFNK VGSYVLFDNA LVKLVKAKAR GPRQ A 

TGEV — IEAKCFVL GAKYLLFNNA LVK1VSVKIL GKKQ K 

OV43 DVWHKQLLLN RGVYKPLLEN IDYFNMRRAK FSLETFT VCADGFMPFL LDDLVPRAYY 

BoCoV DVWHKQLLLN RGVYKPLLEN IDYFNMRRAK FSLETFT— VCADGFMPFL LDDLVPRAYY 

MHV DVYHRQLLVN RGDYSLLLEN VDLFVKRRAE FACK-FA— - TCGDGLVPLL LDGLVPRSYY 

AIBV FEFWKLAYGK VRNLEEFVKT YVCK 

SARS COV KGKPVKGAWN IGQQRSVLTP LCGFPSQAAG VIRSIFARTL DAANHSIPDL QRAAVTILDG 

. • . . I I ....I. ...I ....I. ...| ....I. ...I 

605 615 625 635 645 655 

EMCR -RIKKAMFTK WVGPTTEVK FSVIELATVN LRLVDCAPW CPKGKIWIA GQAFFYSGGF 

229E -GLNKVKYAT VWGSTEEVK SSRVERSTAV LTIANNYSKL FDEGYTWIG DVAYFVSDGY 

PEDV -GICEVRYTS LVVGSTTKVV SKRVENANVN LVWDEDVTL NTTGRTWVD GLAFFESDGF 

TGEV -GLECAFFAT SLVGATVNVT PKRTETATIS LNKVDDWAP G-EGYIVIVG DMAFYKSGEY 

OV43 LAVSGQAFCD YADKLCHAVV SKSKELLDVS LDSLGAAIHY LNSKIVDLAQ HFSDFGTSFV 

BOCOV LAVSGQAFCD YAGKICHAVV SKSKELLDVS VDSLGAAIHY LNSKIVDLAQ HFSDFGTSFV 

MHV LIKSGQAFTS MMVNFSHEVT DMCMDMALLF MHDVKVATKY VKKVTGKLAV RFKALGVAW 

AIBV AQMSIVILAA VLGEDIWHLV. SQVIYKLGVL FTKVVDFCDK HWKGFCVQLK RAKLIVTETF 

SARS COV ISEQSLRLVD AMVYTSDLLT NSVIIMAYVT GGLVQQTSQW LSNLLGTTVE KLRPIFEWIE 

...,|....| ....|., ..| ....|.. ..I ....|.. ..| ....I | 

665 675 685 695 705 715 

EMCR YRFMVDSTTV LNDPVFTGEL FYTIKFSGFK LDGFN HQFVNAS SATDAIIAVE 

229E FRLMASPNSV LTTAVYKPLF AFNVNVMGTR PE KFPTTV TCENLESAVL 

PEDV YRHLADADVV IEHPVYKSAC ELKPVFECDP IP — D FPLPVAA SVAELCVQTD 

TGEV YFMMSSPNFV LTNNVFKAVK VPSYDIVYDV DNDTKSKMIA KLGSSFEYDG DIDAAIVKVN 

OV43 SKIVHFFKTF TTSTALAFAW VLFHVLHGAY IWESDIYFV KN-IPRYASA VAQAFQSVAK 

BoCoV SKIVHFFKTF TTSTALAFAW VLFHVLHGAY IVVESDIYFG KN-IPRYASA VAQAFRSGAK 

MHV RKITEWFDLA VDTAASAAGW LCYQLVNGLF AVANGGITFL SD-VPELVKN FVDKFKVFFK 

AIBV CVLKGVAQHC FQLLLDAIHS LYKSFKKCAL GRIHG DLLFWKGG VHKIVQDGDE 

SARS COV AKLSAGVEFL KD AWE ILKFLITGVF DIVKGQIQVA SDNIKDCVKC FIDWNKALE 

....|. ...I . ... I .... I ....|. ...| . ... I .... I . ... | .... I ....|.. ..I 
725 735 745 755 765 775 

EMCR LLLSDFKTAV FVYTCVVDGC SVIVRRDAT- FATHVCFKDC YSIWEQFCID NCGE 

229E FVNDKITEFQ LDYSIDVIDN EIIVKPNIS- LCVPLYVRDY VDKWDDFCRQ YSNE 

PEDV LLLKNYNTPY KTYSCWRGD KCCITCTLQ- FKAPSYVEDA VN-FVDLCTK NIGT 

TGEV ELLIEFRQQS LCFRAFKDDK SIFVEAYFKK YKMPACLAKH IG-LWNIIKK DSCK 

OV43 WLDSLRVTF IDGLSCFKIG RRRICLSGRK IYEVERG-LL HSSQLPLDVY DLTMPSQVQK 

BoCoV VGLDSLRVTF IDGLSCFKIG RRRICLSGSK IYEVERG-LL HSSQLPLDVY DLTMPSQVQK 

MHV VLIDSMSVSV LSGLTWKTA SNRVCLAGCK VYBVVQK-RL SAYVMPVGCN EATC 

AIBV IWFDAIDSVD VEDLGWQEK SIDFEVCDDV TLPENQPGHM VQIEDDGKNY MFFR 

SARS COV MCIDQVTIAG AK-LRSLNLG EVFIAQSKGL YRQCIRGKEQ LQLLMPLKAP KEVT 
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....I.. ..I ....|.. ..| 
785 795 

EMCR PW FLTDYNAILQ 

229E SW FEDDYRAFIS 

PEDV AG FHEFYITAHE 

TGEV RG FLNLFNHLNE 

OV43 AKQKPIYLKG SGSDFSLADS 

BoCoV TKQKGIYLKG SGSDFSLADS 

MHV LVG EIEPAWEDD 

AI3V FKK DENIYYTPMS 

SARS CoV FLEG DSHDTVLTSE 



....). ...| 

805 
SNNPQCAIVQ 
VLDITDAAVK 
QQDLQGFLTT 
LEDIKETNIQ 
WEWTTSLT 
VVEWTTSLT 
VVDWKAPLT 
QLGAINWCK 
EWLKNGELE 



...I • • ♦ • I I 

815 825 

ASESK — VLLERFLP 

AAESK A FV DTI VP 

CCTMSG F-ECFMPTIP 

AIKN 1- 

PCG YS EPPKVADKIC 

PCG YS EPPKVADKIC 

YQG CC KPPTSFEKIC 

AGG KTVT 

ALETPVDSFT NGAIVGTPVC 



I | 

835 
KCPEILLSID 
PCPSILKVID 
QCPAVLEEID 
LCPDPLLDLD 
IVDNVYMAKA 
IVDNVYMAKA 
VVDKLYMAKC 
FGETTVQEIP 
VNGLMLLEIK 



EMCR 

229E 

PEDV . 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



I I 

345 
DGHLWNLFVE 
GGKIWNGVIK 
GGSIWRSFIT 
YGAIWYNCMP 
GDKYYPVWD 
GDKYYPVWD 
GDQFYPVWD 
PPDWPIKVS 
DKEQYCALSP 



..I.. 
855 



■ I 



....|. ...| 

865 
-FNFVTDWLK 
-VNSVRDWLK 
-LNTMWDFCK 
-CSDP-SVLG 



-DHVGLLDQA WRVPCAG — R 
-GHVGLLDQA WRVPCAG— R 
NDTIGVLDQC WRFPCAG — K 

IECCG — E 

— GLLATNNV FRLKGGAPIK 



I i 

875 
TLKLTLTSNG 
SLKLNLTQQG 
RLKVSFGLDG 
SVQLLIGNG- 
RVTFKEQPTV 
CVTFKEQPTV 
KVEFNDKPKV 
PWNTIFKKAY 
GVTFG-EDTV 



....!....! 

885 
LLGNCAKRFR 
LLGTCAKRFK 
IWTVARKFK 
-VKWCDGCK 
KEIISMPKII 
NEIASTPKTI 
KEIPST-RKI 
KEPIEVDTDL 
WEVQGY-KNV 



.... I • •.. I 

895 
RVLVKLLDVY 
RWLGILLEAY 
RLGALLAEMY 
GFANQLSKGY 
KVFYELDNDF 
KVFYELDKDF 
KINFALDATF 
TVEQLLSVIY 
RITFELDERV 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



.... I .... I 

905 
NGFLETVCSV 
NAFLDTWST 
NTYLSTWEN 
NKLCNAARND 
NTILNTACGV 
NTILNTACGE 
DSVLSKACSE 
EKMCDDLKLF 
DKVLNEKCSV 



915 
VHTAGVCIKY 
VKIGGLTFKT 
LVLAGVSFKY 
IEIGGIPFST 
FEVDDTVDME 
FEVDDTVDME 
FEVDKDVTLD 
PEAPEPPPFE 
YTVESGTEVT 



....|. ...| 

925 
YAVNVP-YVV 
YAFDKP-YIV 
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ASLNSLT- 
ASLNSLT- 
ASLKSLT- 
GIKS 



..I. ...| 
2065 

KSTW 

KNSVA 

CSKR 

AKVE 

YFNRP 

YFNRP 

YFNRP 

W 



TTKTTFKPNT WCLRCLWSTK 



2075 2085 2095 

EVKSAWCAS VLKDGCDVG 

SINSAIVCAS VKRDGVQVG 

WTAPWNAS VLKLGVEDG- 

KFVGPWAAP LAIHGTDE 

SLVDDNKFDV LKVDDVD 

LLVDENKFDV LKVDDVD 

SWCENKFNV LPVDVSEPTD KGPVPAAVLV 

DFRSKDGFIY KLTPDTD 

PVDTSNSFEV LAVEDTQGMD N 
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2105 2115 2125 2135 2145 2155 

EMCR FCPHRH KLRSRVK FVNGRWIT NVGEPIISQP 

22 9E YCVHGI KYYSRVR SVRGRAI IV SVEQLEPCAQ 

PEDV LCPHGL NYIGKW WKGTTIW NVGKPVVAPS 

TGEV TCVHGV SVNVKVT QIKGTVAIT SLIGPIIG— 

OV43 DGGDSS ESGAKE TKEINIIKLS GVKKPFKVED 

BoCoV DGGDIS ESDAKE PKEINIIKLS GVKKPFKVED 

MHV TGALSGAATA PGTAKEQKVC ASDSVVDQW SGFLSDLSGA TVDVKEVKLN GVKKPIKVED 

AIBV ENSKAPVY YPVLDAISLK 

SARS CoV LAC ESQ QPTSEEWEN PTIQKEVIE CDVKTTEWG 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BOCOV 

MHV 

AIBV 

SARS CoV 



....| ....| 

2165 
SKLLNGIAYT 
SRLLSGVAYT 
HLFLKGVSYT 
-EVLEATGYI 
SVIVNDDTSE 
SVIVNDDTSE 
SWVNDPTSE 
AIWVEGNANF 
NVILKPSDEG 
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2175 
TFS— GSFDN 
AFS — GPVDK 
TFLDNGNGW 
CYS — GSNRN 
TKYVKSLSIV 
IKYVKSLSIV 
TKVVKSLSIV 

WG HP 

VKVTQELGHE 



....|. ...| 

2185 
GHYWYDAAN 
GHYTVYDTAK 
GHYTVFDHGT 
GHYTYYDNRN 
DVYDMWLTGC 
DVYDMWLTGC 
DVYDMFLTGC 
NYYSKSLHIP 
DLMAAYVENT 



....|.. ..| 

2195 
NAVYDGARLF 
KSMYDGDRFV 
GMVHDGDAFV 
GLWDAEKAY 
KYWRTANAL 
RCWRTANAL 
RYWWMANEL 
TFWENAENFV 
SITIKKPNEL 



....|....| 

2205 
ASDLSTLAVT 
KHDLSLLSVT 
PGDLNVSPVT 
HFNRDLLQVT 
SRAVNVPTIR 
SRAVNVPTIR 
SRLVNSPTVR 
KMGDKIGGVT 
SLALGLKTIA 



....I. ...I 

2215 
AIVWGGCVT 
SWMVGGYVA 
NVWSEQTAV 
TAIASNFWK 
KFIKFGMTLV 
KFIKFGMTLV 
EYVKWGMTKI 
MGLWRAEHLN 
THGIAAINSV 
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2225 2235 2245 2255 2265 2275 

EMCR S NVPP IVSEKISVMD KLDTG AQ KFFQFGDFVM NNIVLFLTWL 

229E PV NTVKPKPVIN QLDEK AQ KFFDFGDFLI HNFVIFFTWL 

PEDV V- --IKDP VKKAELDATK LLDTMNYASE RFFSFGDFMS RNLITVFLYI 

TGEV KPQAEERPKN CAFNKVAASP KIVQEQKLLA IESGANYALT EFGRYADMFF MAGDKILRLL 

OV43 SIP 1 DLL NLREIKPAVN WKAVRNKIS VCFNFIKWLF VLLFGWIKIS 

BoCoV SIP 1 DLL NLREIKPVFN WKAVRNKIS ACFNFIKWLF VLLFGWIKIS 

MHV VIP AKLV LLRDEKQEFV APKWKAKVI ACYSAVKWFF LYCFSWIKFN 

AIBV KPN LERI FNIAKKAIVG SSWTTQCGK LIGKAATFIA DKVGGGWRN 

SARS CoV PWS KILA YVKPFLG QAAITTSN CAKRLAQRVF NNYMPYVFTL 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BOCOV 

MHV 

AIBV 

SARS CoV 
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2285 
LSMFSLLRTS 
LSMFTLCKTA 
LSILGLCFRA 
LEVFKYLLVL 
ADNKVIYTTE 
ADNKVIYTTE 
TDNKVIYTTE 
ITDSIKGLCG 
LFQLCTFTKS 



2295 
IMKHDIKVIA 
VTTGDVKIMA 
FRKRDVKVLA 
FMCLRSTKMP 
IASKLTCKLV 
VASKLTCKLV 
VASKLTFNLC 
ITRGHFERKM 
TNSRIRASLP 



....|....| 

2305 
KAPKRTGVIL 
KAPQRTGVVL 
GVPQRTGIIL 
KVKVKP-PLA 
ALAFKNAFLT 
ALAFKNAFLT 
CLAFKNALQT 
SPQFLKTLMF 
TTIAKNSVKS 



....|.. ..| 

2315 
TRSFKYNIRS 
KRSLKYNLKA 
RKSMRYNAKA 
FKDFGAKVRT 
FKWSMVARGA 
FKWSWARGA 
FNWNWSRGF 
FLFYFLKASV 
VAKLCLDAGI 



....(... .1 

2325 
ALFWKQKWC 
SAAVLKSKWW 
LGVFFKLKLY 
LNYMRQLNKP 
CIIATIFLLW 
CIIATIFLLW 
FLVATVFLLW 
KSWASYKTV 
NYVKSPKFSK 



,...|....| 

2335 
VIVTLFKFLL 
LLAKFTKLLL 
WFKVLGKFSL 
SVWRYAKLVL 
FNFIYANVIF 
FNFIYANVIF 
FNFLYANVIL 
LCKWLATLL 
LFTIAMWLLL 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BOCOV 

MHV 

AIBV 

SARS COV 



I .... ! 

2345 
LLYAIYALVF 
LIYTLYSVVL 
GIYALYALLF 
LLIAIYNFFY 
SDFYLPKIGF 
SDFYLPKIGF 
SDFYLPNIGF 
IVWFVYTSNP 
LSICLGSLIC 



....|....| 

2355 
MIVQFSPFNS 
LCVRFGPFN- 
MTIRFTPIGS 
LFVSIPWHK 
LPTFVGKIAQ 
LPTFVGKIVQ 
FPTFVGQIVA 
VMFTGIRVLD 
VTAAFGVLLS 



....|. ...| 

2365 
LLCGDIVSGY 
-FCSETVNGY 
PVCDDWAGY 
LTCNGAVQAY 
WIKNTFSLVT 
WIKNTFSLVT 
WVKTTFGIFT 
FLFEGSLCGP 
NFGAPSYCNG 



....|.. ..| 
2375 

EKSTFN 

AKSNFV 

ANSSFD-— 

KNSSFI 

ICDLYSMQDV 
ICDLYSIQDV 
LCDLYQVSDV 
YKDYGK — DS 
VRELYLNSSN 



■ I . 



2385 
— KDIYCGNS 
— KDDYCDGS 
— KNEYCN-S 
— KSAVCGNS 
GFKNQYCNGS 
GFKNQYCNGS 
GYRSSFCNGS 
FDVLRYCADD 
VTTMDFCEGS 



....I I 

2395 
MVCKMCLFSY 
LGCKMCLFGY 
VICKVCLYGY 
ILCKACLASY 
IACQFCLAGF 
IACQFCLAGF 
MVCELCFSGF 
FICRVCLHDK 
FPCSICLSGL 
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2405 2415 2425 2435 2445 2455 

EMCR QEFNDLDHTS LVWKHIR D — P ILISLQPFV ILVILLIFGN MYLRFGLLYF 

229E QELSQFSHLD WWKHIT-- D--P LFSNMQPFI VMVLLLIFGD NYLRCFLLYF 

PEDV QELSDFSHTQ VVWQHLR — D--P LIGNVMPFF YLAFLAIFGG VYVKAITLYF 

TGEV DELADFQHLQ VTWDFKS D— P LWNRLVQLS YFAFLAVFGN NYVRCFLMYF 

OV43 DMLDNYKAID VVQYEADRRA FVDYTGVLKI VIELIVSYAL YTAWFYPLFA LISIQILTTW 

BoCoV DMLDNYKAID VVQYEADRRA FVDYTGVLKI VIELIVSYAL YTAWFYPLFA LISIQILTTW 

MHV DMLDNYDAIN VVQHWDRRV SFDYISLFKL WELVIGYSL YTVCFYPLFG LIGMQLLTTW 

AIBV DSLHLYKHAY SVEQVYKDAA SG FIFNWNWL YLVFLILFVK PVAGFVIICY 

SARS COV DSLDSYPALE TIQVTIS— S YKLDLTILGL AAEWVLAYML FTKFFYLLGL SAIMQVFFGY 
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2465 2475 2485 2495 2505 2515 

EMCR VAQFISTFG- -SFLGFHQKQ WFLHFVPFDV LCNEFLATFI VCKIVLFVRH IIVGCNNADC 

229E VAQMISTVG- -VFLGYKETN WFLHFIPFDV ICDELLVTVI VIKVISFVRH VLFGCENPDC 

PEDV IFQYLNSLG- -VFLGLQQSI WFLQLVPFDV FGDEIWFFI VTRVLMFIKH VCLGCDKASC 

TGEV VSQYLNLWL- -SYFGYVEYS WFLHWNFES ISAEFVIWI WKAVLALKH IVFACSNPSC 

OV43 LPELFMLST- -LHWSFRLLV ALANMLPAHV FMRFYIIIAS FIKLFSLFRH VAYGCSKSGC 

BoCoV LPELLMLST- -LHWSVRLLV SLANMLPAHV FMRFYIIIAS FIKLFSLFRH VAYGCSKSGC 

MHV LPEFFMLET- -MHWSARFFV FVANMLPAFT LLRFYIWTA MYKIFCLCRH VMYGCSRPGC 

AIBV CVKYLVLNST VLQTGVCFLD WFVQTVFSHF NFMGAGFYFW LFYKIYIQVH HILYCKDVTC 

SARS CoV FASHFISN SWLMWFII SIVQMAPVSA MVRMYIFFAS FYYIWKSYVH IMDGCTSSTC 

....|....| I I .1 .... I I .... I ....I.... I I 1 

2525 2535 2545 2555 2565 2575 

EMCR VACSKSARLK RVPLQTIING MHKSFYVNAN GGTCFCNKHN FFCVNCDSFG PGNTFINGDI 

22 9E IACSKSARLK RFPVNTIVNG VQRSFYVKAN GGSKFCKKHR FFCVDCDSYG YGSTFITPEV 

PEDV VACSKSARLK RVPVQTIFQG TSKSFYVHAN GGSKFCKKHN FFCLNCDSYG PGCTFINDVI 

TGEV KTCSRTARQT RIPIQVWNG SMKTVYVHAN GTGKFCKKHN FYCKNCDSYG FENTFICDEI 

OV43 LFCYKRNRSL RVKCSTIVGG MIRYYDVMAN GGTGFCSKHQ WNCIDCDSYK PGNTFITVEA 

BoCOV LFCYKRNRSL RVKCSTIVGG MIRYYDVMAN GGTGFCSKHQ WNCIDCDSYK PGNTFITVEA 

MHV LFCYKRNRSV RVKCSTWGG TLRYYDVMAN GGTGFCAKHQ WNCLNCSAFG PGNTFITHEA 

AIBV EVCKRVARSN RQEVSWVGG RKQIVHVYTN SGYNFCKRHN WYCRNCDDYG HQNTFMSPEV 

SARS CoV MMCYKRNRAT RVECTTIVNG MKRSFYVYAN GGRGFCKTHN WNCLNCDTFC TGSTFISDEV 

...,|. ...| ....|....| . . . . | I ! I ....|. ...I 

2585 2595 2605 2615 2625 2635 

EMCR ARELGNWKT AVQPTAPAYV IIDKVDFVNG FYRLYSGDTF WRYDFDITES KYSCKE 

229E SRELGNITKT NVQPTGPAYV MIDKVEFENG FYRLYSCETF WRYNFDITES KYSCKE 

PEDV ATEVGNWKL NVQPTGPATI LIDKVEFSNG FYYLYSGDTF WKYNFDITDS KYTCKE 

TGEV VRDLSNSVKQ TVYATDRSHQ EVTKVECSDG FYRFYVGDEF TSYDYDVKHK KYSSQE 

OV43 ALDLSKELKR PIQPTDVAYH TVTDVKQVGC SMRLFYDRDG QRTYDDVNAS LFVDYSNLLH 

BoCoV ALDLSKELKR PIQPTDVAYH TVTDVKQVGC YMRLFYDRDG QRTYDDVNAS LFVDYSNLLH 

MHV AADLSKELKR PVNPTDSAYY LVTEVKQVGC SMRLFYERDG QRVYDDVSAS LFVDMNGLLH 

AIBV AGELSEKLKR HVKPTAYAYH WDEACLVDD FVNLKYKAAT PGKDSASSAV KCFSVTDFLK 

SARS COV ARDLSLQFKR PINPTDQSSY IVDSVAVKNG ALHLYFDKAG QKTYERHPLS HFVNLDNLRA 

,...|....| I I . ... I .... I ....|.... | ....|....| ....|.... I 

2645 2655 2665 2675 2685 2695 

EMCR -VLKNCNVLE NFIVYNN SGSNI TQIKNACVYF SQLLCEPIKL VNSELLSTLS 

229E -VFKNCNVLD DFIVFNN NGTNV TQVKNASVYF SQLLCRPIKL VDSELLSTLS 

PEDV -ALKNCSIIT DFIVFNN NGSNV NQVKNACVYF SQMLCKPVKL VDSALLASLS 

TGEV -VLKSMLLLD DFIVYSP SGSAL ANVRNACVYF SQLIGKPIKI VNSDLLEDLS 

OV43 SKVKSVPNMH WWEN DADK ANFLNAAVFY AQSLFRPILM VDKNLITTAN 

BoCOV . SKVKSVPNMH WWEN DADK ANFLNAAVFY AQSLFRPILM VDKILITTAN 

MHV SKVKGVPETH WWEN EADK AGFLNAAVFY AQSLYRPMLL VEKKLITTAN 

AIBV KAVFLKEALK CEQISNDGFI VCNTQSAHAL EEAKNAAIYY AQYLCKPILI LDQALYEQLV 
SARS CoV NNTKGSLPIN VIVFDGK SKCDE SASKSASVYY SQLMCQPILL LDQVLVSDVG 

....I.... I ....|....| . ... I .... I I I I I 

2705 2715 2725 2735 2745 2755 

EMCR — VDFNGVLH KAYVDVLCNS FFKELTANMS MAECKATLGL T 

229E —VDFNGVLH KAYIDVLRNS FGKDLNANMS LAECKRALGL S 

PEDV — VDFGASLH SAFVSVLSNS FGKDLSSCND MQDCKSTLGF DD 

TGEV — VDFKGALF NAKKNVIKNS FNVDVSECKN LDECYRACNL N 

OV43 TGTSVTETMF DVYVDTFLSM FDVDKKSLNA LIATAHSSIK QGTQIYKVLD TFLSCARKSC 

BoCoV TGTSVTETMF DVYVDTFLSM FDVDKKSLNA LIATAHSSIK QGTQICKVLD TFLSCARKSC 

MHV TGLSVSQTMF DLYVDSLLGV LDVDRKSLTS FVNAAHNSLK EGVQLEQVMD TFIGCARRKC 

AIBV V-EPVSKSVI DKVCSILSSI ISVDTAALNY KAGTLRDALL S 

SARS CoV DSTEVSVKMF DAYVDTFSAT FSVPMEKLKA LVATAHSELA KGVALDGVLS TFVSAARQG- 
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2765 2775 2785 2795 2805 2815 

EMCR VSDDD FVSAVANAHR YDVLLSDLSF NNFFISYAKP EDK-LSVYDI ACCMRAGSKV 

22 9E ISDHE FTSAISNAHR CDVLLSDLSF NNFVSSYAKP EEK-LSAYDL ACCMRAGAKV 

PEDV VPLDT FNAAVAEAHR YDVLLTDMSF NNFTTSYAKP EEK-FPVHDI ATCMRVGAKI 

TGEV VSFST FEMAVNNAHR FGILITDRSF NNFWPSKVKP GSSGVSAMDI GKCMTSDAKI 

OV43 SIDSDVDTKC LADSVMSAVS AGLELTDESC NNLVPTYLKS DN — IVAADL GVLIQNSAKH 

BoCoV SIDSDVDTKC LADSVMSAVS AGLELTDESC NNLVPTYLKG DN— IVAADL GVLIQNSAKH 

MHV AIDS DVETKS ITKSIMSAVN AGVDFTDESC NNLVPTYVKS DT— IVAADL GVLIQNNAKH 

AIBV ITKDEE AVDMAIFCHN HDVDYTGDGF TNVIPSYGID TG-KLTPRDR G PL IN ADAS I 

SARS CoV VVDTDVDTKD VIECLKLSHH SDLEVTGDSC NNFMLTYNKV EN— MTPRDL GACIDCNARH 

• ... I .... I . ... I .... I ....|.... I t .... I • ... i .... I 

2825 2835 2845 2855 2865 2875 

EMCR VNHNVLIKES IPIVWGVKDF NTLSQEGKKY LVKTTKAKGL TFLLTFNDNQ AITQVP 

229E VNANVLTKDQ TPIVWHAKDF NSLSAEGRKY IVKTSKAKGL TFLLTINENQ AVTQIP 

PEDV VNHNVLVKDS IPVVWLVRDF IALSEETRKY IIRTTKVKGI TFMLTFNDCR MHTTIP 

TGEV VNAKVLTQRG KSWWLSQDF AALSSTAQKV LVKTFVEEGV NFSLTFNAVG SDDDLPYERF 

OV43 VQGNVAKIAG VSCIWSVDAF NQFSSDFQHK LKKACCKTGL KLKLTYNKQM ANVSVLT— 

BoCOV VQGNVAKIAG VSCIWSVDAF NQLSSDFQHK LKKACCKTGL KLELTYNKQM ANVSVLT— 

MHV VQANVAKAAN VACIWSVDAF NQLSADLQHR LRKACSKTGL KIKLTYNKQE ANVPILT 

AIBV ANLRVKN— A PPWWKFSEL IKLSDSCLKY LISATVKSGV RFFITKSGAK QVIACHT 

SARS CoV INAQVAKSHN VSLIWNVKDY MSLSEQLRKQ IRSAAKKNNI PFRLTCATTR QWNVIT— 
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2885 2895 2905 2915 2925 2935 

EMCR ATSIVAKQGA G FKRTYNFL WYVCLFWAL FIGVSFID YTTTVTS 

22 9E ATSIVAKQGA GD AGHSLTWL WLLCGLVCLI QFYLCFFMPY --FMYDIVSS 

PEDV TVCIANKKGA GLPS FSKVKKFF WFLCLFIVAA FFALSFLD FSTQVSS 

TGEV TESVSPKSGS G FFDVITQL KQIVILVFVF IFICGLCSVY SVATQSYIES 

OV43 — TPFSLKGG A— V FSYFVYVC FVLSLVCFIG LWCLMPT YTVHKSDFQL 

BoCoV — TPFSLKGG A--V FSYFVYVC FVLSLVCFIG LWCLMPT YTVHKSDFQL 

MHV —TPFSLKGG A--V FSKVLQWL FWNLICFIV LWALMPT YAVHKSDMQL 

AIBV QKLLVEKKAG GIVSGTFKCF KSYFKWLLIF YILFTACCSG YYYMEVSKSF VHPMYDVNST 

SARS CoV — TKISLKGG K — I VSTCFKLM LKATLLCVLA ALVCYIVMPV HTLSIHDGYT 

I I ....I. ..-I . ... I .... I ....|.... | . ... I .... I 

2945 2955 2965 2975 2985 2995 

EMCR FHGYDFKYIE NGQLKVFEAP LHCVRNVFDN FNQWHEAKFG WTTNSDKCP IWG -VS 

22 9E FEGYDFKYIE NGQLKNFEAP LKCVRNVFEN FEDWHYAKFG FTPLNKQSCP IWG VS 

PEDV DSDYDFKYIE SGQLKTFDNP LSCVHNVFIN FDQWHDAKFG FTPVNNPSCP IWG VS 

TGEV AEGYDYMVIK NGIVQPFDDT I SCVHNTYKG FGDWFKAKYG FTPTFGKSCP IWGT-VFDL 

OV43 PVYASYKVLD NGVIRDVSVE DVCFANKFEQ FDQWYESTFG LSYYSNSMAC PIWA-VIDQ 

BOCOV PVYASYKVLD NGVIRDVSVE DVCFANKFEQ FDQWYESTFG LSYYSNSMAC PIWA-WDQ 

MHV PLYASFKVID NGVLRDVTVT DACFANKFIQ FDQWYESTFG LVYYRNSRAC PVVVA-VIDQ 

AIBV LHVEGFKVID KGVLREIVPE DTCFSNKFVN FDAFWGRPYD NSRNCPIVTA VIDGDGTVAT 

SARS CoV NEIIGYKAIQ DGVTRDIIST DDCFANKHAG FDAWFSQRGG S — YKNDKSC PWAA-IITR 
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3005 3015 3025 3035 3045 3055 

EMCR ERINWPGVP TNVYLVG KTLVFTLQA AFGNTGVCYD FDGVTTS DKCIFNSA 

229E EIVNTVAGIP SNVYLVG KTLIFTLQA AFGNAGVCYD IFGVTTP EKCIFTSA 

PEDV OEARTVPGIP AGVYLAG KTLVFAINT IFGTSGLCFD ASGVADK GACIFNSA 

TGEV ENMRPIPDVP AYVSIVG RSLVFAINA AFGVTNMCYD HTGNAVSKDS YFDTCVFNTA 

OV43 DFGSTVFNVP TKVLRYG YHVLHFITHA LSADGVQCYT PHSQISYSNF YASGCVLSSA 

BOCOV DFGSTVFNVP TKVLRYG YHVLHFITHA LSADGVQCYT PHSQISYSNF YASGCVLSSA 

MHV DIGYTLFNVP TKVLRYG— FHVLHFITHA FATDSVQCYT PHMQIPYDNF YASGCVLSSL 

AIBV GVPGFVSWVM DGVMFIHMTQ TERKPWYIPT WFNREIVGYT QDSIITEGSF YTSIALFSAR 

SARS CoV EIGFIVPGLP GTVLRAIN — GDFLHFLPRV FSAVGNICYT PSKLIEYSDF ATSACVLAAE 
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3065 3075 3085 3095 3105 3115 

EMCR CTRLEGLGGD -NVYCYNTDL IEGSKPYSIL QPNAYYKYDV K-NYVRFPEI LARGFGLRTI 

22 9E CTRLEGLGGN -NVYCYNTAL MEGSLPYSSI QANAYYKYDN G-NFIKLPEV IAQGFGFRTV 

PEDV CTTLSGLGGT -AVYCYKNGL VEGAKLYSEL APHSYYKMVD G-NAVSLPEI ISRGFGIRTI 

TGEV CTTLTGLGGT -IVYCAKQGL VEGAKLYSDL MPDYYYEHAS G-NMVKLPAI IR-GLGLRFV 

OV43 CTMFTMADGS PQPYCYTEGL MQNASLYSSL VPHVRYNLAN AKGFIRFPEV LREGL-VRIV 

BOCOV CTMFAMADGS PQPYCYTDGL MQNASLYSSL VPHVRYNLAN AKGFIRLPEV LREGL-VRIV 

MHV CTMLAHADGT PHPYCYTEGI MHNASLYDSL APHVRYNLAN SNGYIRFPEV VSEGI-VRIV 

AIBV CLYLTASNTP QLYCFNGDND APGALPFGSI IPHRVYFQPN GVRLIVPQQI LHTPY W 

SARS CoV CTIFKDAMGK PVPYCYDTNL LEGSISYSEL RPDTRYVLMD G-SIIQFPNT YLEGS-VRW 
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3125 3135 3145 3155 3165 3175 

EMCR RTLATRYCRV GECRDSHKGV CFGFDKWYVN DGRVD DG YICGDGLIDL LVNVLSIFSS 

229E RTIATKYCRV GECVESNAGV CFGFDKWFVN DGRVA NG YVCGTGLWNL VFNILSMFSS 

. PEDV RTKAMTYCRV GQCVQSAEGV CFGADRFFVY NAESG SD FVCGTGLFTL LMNVISVFSK 

TGEV KTQATTYCRV GECIDSKAGF CFGGDNWFVY DNEFG— NG YICGNSVLGF FKNVFKLFNS 

OV43 RTRSMSYCRV GLCEEADEGI CFNFNGSWVL NNDYYRSLPG TFCGRDVFDL IYQLFKGLAQ 

BOCOV RTRSMSYCRV GLCEEADEGI CFNFNGSWVL NNDYYRSLPG TFCGRDVFDL IYQLFKGLAQ 

MHV RTRSMTYCRV GLCEDAEEGV CFNFNSSWVL NNPYYRAMPG TFCGRNAFDL IHQVLGGLVR 

AIBV KFVSDSYCRG SVCEYTRPGY CVSLNPQWVL FNDEYTSKPG VFCGSTVREL MFSMVSTFFT 

SARS CoV TTFDAEYCRH GTCERSEVGI CLSTSGRWVL NNEHYRALSG VFCGVDAMNL IANIFTPLVQ 
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3185 3195 3205 3215 3225 3235 

EMCR SFSVVAMSGH MLFNFLFAAF ITFLCFLVTK FKRVFGDLSY GVFTVVCATL INNISYWTQ 

229E SFSVAAMSGQ ILLNCALGAF AIFCCFLVTK FRRMFGDLSV GVCTVWAVL LNNVSYIVTQ 

PEDV TVPVTVLSGQ ILFNCIIAFV AVAVCFLFTK FKRMFGDMSV GVFTVGACTL LNNVSYIVTQ 

TGEV NMSWATSGA MLVNIIIACL AIAMCYGVLK FKKIFGDCTF LIVMIIVTLV VNNVSYFVTQ 

OV43 PVDFLALTAS SIAGAILAVI VVLVFYYLIK LKRAFGDYTS VVFVNVIVWC VNFMMLFVFQ 

BoCoV PVDFLALTAS SIAGAILAVI WLGFYYLIK LKRAFGDYTS IVFVNVIVWC VNFMMLFVFQ 

MHV PIDFFALTAS SVAGAILAII WLAFYYLIK LKRAFGDYTS WVINVIVWC INFLMLFVFQ 

AIBV GVN-PNIYMQ LATMFLILW VVLIFAMVIK FQGVFKAYAT TVFITMLVWV INAFILCVHS 

SARS COV PVGALDVSAS WAGGIIAIL VTCAAYYFMK FRRVFGEYNH WAANALLFL MSFTILCLVP 
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3245 3255 3265 3275 3285 3295 

EMCR N-LFFMLLYA ILYFVFTRTV R— YAWIWHI AYIVAYFLLI PWWLLTWFSF AAFLELLPNV 

229E N-LVTMIAYA ILYFFATRSL R--YAWIWCA AYLIAYISFA PWWLCAWYFL AMLTGLLPSL 

PEDV N-TLGMLGYA TLYFLCTKGV R— YMWIWHL GFLISYILIA PWWVLMVYAF SAIFEFMPNL 

TGEV N-TFFMIIYA IVYYFITRKL A — YPGILDA GFIIAYINMA PWYVITAYIL VFLYDSLPSL 

OV43 VYPILSCVYA ICYFYATLYF PSEISVIMHL QWLVMYGTIM PLWFCLLYIA WVSNHAFWV 

BoCoV VYPTLSCVYA ICYFYATLYF PSEISVIMHL QWLVMYGTIM PLWFCLLYIS WVSNHAFWV 

MHV VYPTLSCLYA CFYFYTTLYF PSEISWMHL QWLVMYGAIM PLWFCIIYVA WVSNHALWL 

AIBV YNSVLAVILL VLYCYASLVT SRNTVIIMHC WLVFTFGLIV PTWLACCYLG FIIYMYTPLF 

SARS COV AYSFLPGVYS VFYLYLTFYF TNDVSFLAHL QWFAMFSPIV PFWITAIYVF CISLKHCHWF 
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. ... I .... I ....|. ...| ....|. ...| . ... I .... I ....|....| 
3305 3315 3325 3335 3345 3355 

EMCR FKLKISTQ LFEGDKFI GTFESAAAGT FVLDMRSYER LINT— IS PE KLKNYAASYN 

229E LKLKVSTN LFEGDKFV GTFESAAAGT FVIDMRSYEK LANS — ISPE KLKSYAASYN 

PEDV FKLKVSTQ LFEGDKFV GSFENAAAGT FVLDMHAYEB LANS— ISTE KLRQYASTYN 

TGEV FKLKVSTN LFEGDKFV GNFESAAMGT FV I OMRS YET IVNS— TSIA RIKSYANSFN 

OV43 FSYCRKLG TSVR— SD GTFEEMALTT FMITKDSYCK LKNS— LSDV AFNRYLSLYN 

BOCOV FSYCRQLG TSVR— SD GTFEEMALTT FMITKDSYCK LKNS— LSDV AFNRYLSLYN 

MHV FSYCRKLG TEVR— SD GTFEEMSLTT FMITKESYCK LKNS— VSDV AFNRYLSLYN 

AIBV LWCYGTTKNT RKLYDGNEFV GNYDLAAKST FVIRGSEFVK LTNE IGD KFEAYLSAYA 

SARS COV FNNYLRKR VMFNGVTF STFEEAALCT FLLNKEMYLK LRSETLLPLT QYNRYLALYN 



I I ....I.... I ...-I. ...| . ... I .... I ....|. ...| . ... I .... I 

3365 3375 3385 3395 3405 3415 

EMCR KYKYYSGSAS EADYRCACYA HLAKAMLDYA -KDHNDMLYS PPTISYN-ST LQSGLKKMAQ 

22 9E RYKYYSGNAN EADYRCACYA YLAKAMLDFS -RDHNDILYT PPTVSYG-ST LQAGLRKMAQ 

PEDV KYKYYSGSAS EADYRLACFA HLAKAMMDYA -SNHNDTLYT PPTVSYN-ST LQAGLRKMAQ 

TGEV KYKYYTGSMG EADYRMACYA HLGKALMDYS -VNRTDMLYT PPTVSVN-ST LQSGLRKMAQ 

OV43 KYRYYSGKMD TAAYREAACS QLAKAMDTFT NNNGSDVLYQ PPTASVSTSF LQSGIVKMVN 

BoCoV KYRYYSGKMD TAAYREAACS QLAKAMDTFT NNNGSDVLYQ PPTASVSTSF LQSGIVKMVN 

MHV KYRYFSGKMD TAAYREAACS QLAKAMETFN HNNGNDVLYQ PPTASVTTSF LQSGIVKMVF 

AIBV RLKYYSGTGS EQDYLQACRA WLAYALDQYR -NSGVEIVYT PPRYSIGVSR LQSGFKKLVS 

SARS CoV KYKYFSGALD TTSYREAACC HLAKALNDFS -NSGADVLYQ PPQTSITSAV LQSGFRKMAF 



....I.. ..I . ... 1 I ....I.. ..| . ... I .... I ....|. ...I 

3425 3435 3445 3455 3465 3475 

EMCR PSGCVERCW RVCYGSTVLN GVWLGDTVTC PRHVIAPSTT VL-IDYDHAY STMRLHNFSV 

229E PSGFVEKCW RVCYGNTVLN GLWLGDIVYC PRHVIASNTT SA-IDYDHEY SIMRLHNFSI 

PEDV PSGVVEKCIV RVCYGNMALN GLWLGDIVMC PRHVIASSTT ST-IDYDYAL SVLRLHNFSI 

TGEV PSGLVEPCIV RVSYGNNVLN GLWLGDEVIC PRHVIASDTT RV- IN YEN EM SSVRLHNFSV 

OV43 PTSKVEPCVV SVTYGNMTLN GLWLDDKVYC PRHVICSASD MTNPDYTNLL CRVTSSDFTV 

BOCOV PTSKVEPCIV SVTYGNMTLN GLWLDDKVYC PRHVICSASD MTNPDYTNLL CRVTSSDFTV 

MHV PTSKVEPCVV SVTYGNMTLN GLWLDDKVYC PRHVICSSAD MTDPDYSNLL CRVISSDFCV 

AIBV PSSAVEKCIV SVSYRGNNLN GLWLGDTIYC PRHVLGKFSG DQ WNDVL NLANNHEFEV 

SARS CoV PSGKVEGCMV QVTCGTTTLN GLWLDDTVYC PRHVICTAED MLNPNYEDLL IRKSNHSFLV 

....|....| ....|....| i | i ... I .... I 

3485 3495 3505 3515 3525 3535 

EMCR SHNG-VFLGV VGVTMHGSVL RIKVSQSNVH TPKHVFKTLK PGASFNILAC YEGIASGVFG 

22 9 E ISGT-AFLGV VGATMHGVTL KIKVSQTNMH TPRHSFRTLK SGEGFNILAC YDGCAQGVFG 

PEDV SSGN-VFLGV VSATMRGALL QIKVNQNNVH TPKYTYRTVR PGESFNILAC YDGAAAGVYG 

TGEV SKNN-VFLGV VSARYKGVNL VLKVNQVNPN TPEHKFKSIK AGESFNILAC YEGCPGSVYG 

OV43 LFDR-LSLTV MSYQMRGCML VLTVTLQNSR TPKYTFGWK PGETFTVLAA YNGKPQGAFH 

BoCoV LFDR-LSLTV MSYQMQGCML VLTVTLQNSR TPKYTFGVVK PGETFTVLAA YNGKPQGAFH 

MHV MSGR-MSLTV MSYQMQGSLL VLTVTLQNPN TPKYSFGWK PGETFTVLAA YNGKSQGAFH 

AIBV TTQHGVTLNV VSRRLKGAVL I LQTAVANAE TPKYKFIKAN CGDSFTIACA YGGTWGLYP 

SARS CoV QAGN-VQLRV IGHSMQNCLL RLKVDTSNPK TPKYKFVRIQ PGQTFSVLAC YNGSPSGVYQ 



....|. ...| ....|-...| ....|. ...| . . . . | | ....|. ...| 

3545 3555 3565 3575 3585 3595 

EMCR VNLRTNFTIK GSFINGACGS PGYNVRNDGT VEFCYLHQIE LGSGAHVGSD FTGSVYGNFD 

22 9E VNMRTNWTIR GSFINGACGS PGYNLKN-GE VEFVYMHQIE LGSGSHVGSS FDGVMYGGFE 

PEDV VNMRSNYTIR GSFINGACGS PGYNINN-GT VEFCYLHQLE LGSGCHVGSD LDGVMYGGYE 

TGEV VNMRSQGTIK GSFIAGTCGS VGYVLEN-GI LYFVYMHHLE LGNGSHVGSN FEGEMYGGYE 

OV43 VTMRSSYTIK GSFLCGSCGS VGYVIMG-DC VKFVYMHQLE LSTGCHTGTD FNGDFYGPYK 

BOCOV VTMRSSYTIK GSFLCGSCGS VGYVIMG-DC VKFVYMHQLE LSTGCHTGTD FNGDFYGPYK 

MHV VTMRSSYTIK GSFLCGSCGS VGYVLTG-DS VRFVYMHQLE LSTGCHTGTD FSGNFYGPYR 

AIBV VTMRSNGTIR ASFLAGACGS VGFNIEK-GV VNFFYMHHLE LPNALHTGTD LMGEFYGGYV 

SARS CoV CAMRPNHTIK GSFLNGSCGS VGFNIDY-DC VSFCYMHHME LPTGVHAGTD LEGKFYGPFV 



....|.... I ...,|....| ....|....| ....)....! 

3605 3615 3625 3635 3645 3655 

EMCR DQPSLQVESA NLMLSDNVVA FLYAALLNGC R WWL RSTRVNVDGF NEWAMANGYT 

22 9E DQPNLQVESA NQMLTVNWA FLYAAILNGC T WWL KGEKLFVEHY NEWAQANGFT 

PEDV OQPTLQVEGA SSLFTENVLA FLYAALINGS T WWL SSSRIAVDRF NEWAVHNGMT 

TGEV DQPSMQLEGT NVMSSDNVVA FLYAALINGE R WFV TNTSMSLESY NTWAKTNSFT 

OV43 DAQWQLLIQ DYIQSVNFVA WLYAAILNNC N WFV QSDKCSVEDF NVWALSNGFS 

BOCOV DAQWQLPVQ DYIQSVNFVA WLYAAILNNC N WFV QSDKCSVEDF NVWALSNGFS 

MHV DAQWQLPVQ DYTQTVNVVA WLYAAILNRC N WFV QSDSCSLEEF NVWAMTNGFS 

AIBV DEEVAQRVPP DNLVTNNIVA WLYAAIISVK ESSFSLPKWL ESTTVSVDDY NKWAGDNGFT 
SARS COV DRQTAQAAGT DTTITLNVLA WLYAAVINGD R WFL NRFTTTLNDF NLVAMKYNYE 



....|....| ....|.... | ....|....| ....|....| ....|....| ....|.... | 
3665 3675 3685 3695 3705 3715 

EMCR IVSSVEC— Y SILAAKTGVS VEQLLASIQH LHE-GFGGKN ILGYSSLCDE FTLAEVVKQM 

229E AMNGEDA— F SILAAKTGVC VERLLHAIQV LNN-GFGGKQ ILGYSSLNDE FSINEWKQM 

PEDV TVGNTDC— F SILAAKTGVD VQRLLASIQS LHK-NFGGKQ ILGHTSLTDE FTTGEVVRQM 

TGEV ELSSTDA— F SMLAAKTGQS VEKLLDSIVR LNK-GFGGRT ILSYGSLCDE FTPTEVIRQM 

OV43 QVKSDLV— I DALASMTGVS LETLLAAIKR LKN-GFQGRQ IMGSCSFEDE LTPSDVYQQL 

BoCoV QVKSDLV— I DALASMTGVS LETLLAAIKR LKN-GFQGRQ IMGSCSFEDE LTPSDVYQQL 

MHV SIKADLV — L DALASMTGVT VEQILAAIKR LYS-GFQGKQ ILGSCVLEDE LTPSDVYQQL 

AIBV PFSTSTA — I TKLSAITGVD VCKLLRTIMV KNS-QWGGDP ILGQYNFEDE LTPESVFNQI 

SARS CoV PLTQDHVDIL GPLSAQTGIA VLDMCAALKE LLQNGMNGRT ILGSTILEDE FTPFDWRQC 
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....I. ...| ....|. ...| ....|.. ..| ....|. ...| ....|....| 
3725 3735 3745 3755 3765 3775 

EMCR YGVNLQS GKVIFGLKTM FLFSVFFTMF WAELFIYTNT IWINPVILTP IFCLLLFLSL 

22 9E FGVNLQS GKTTSMFKSI SLFAGFFVMF WAELFVYTTT IWVNPGFLTP FMILLVALSL 

PEDV YGVNLQG GYVSRACRNV LLVGSFLTFF WSELVSYTKF FWVNPGYVTP MFACLSLLSS 

TGEV YGVNLQA GKVKSFFYPI MTAMTILFAF WLEFFMYTPF TWINPTFVSI VLAVTTLIST 

OV43 AGIKLQSKRT RLFKGTVCWI MASTFLFSCI ITAFVKWTMF MYVTTNMFS- ITFCALCVIS 

BoCoV AGIKLQSKRT RLVKGIVCWI MASTFLFSCI ITAFVKWTMF MYVTTNMLS- ITFCALCVIS 

MHV AGVKLQSKRT RWKGTCCWI LASTLLFCSI ISAFVKWTMF MYVTTHMLG- VTLCALCFVS 

AIBV GGVRLQS SFVRKATSW FWSRCVLACF LFVLCAIVLF TAVPLKFYVY AAVILLMAVL 

SARS CoV SGVTFQGKFK KIVKGTHHWM LLTFLTSLLI LVQSTQWSLF FFVYENAFLP FTLGIMAIAA 



EMCR 

229E 

PEDV 

TGEV 

OV4 3 

BoCoV 

MHV 

AIBV 

SARS CoV 



....| | 

3785 
VLTMFLKHKF 
CLTFWKHKV 
LLMFTLKHKT 
VFVSGIKHKM 
LAMLLVKKKH 
LAMLLVKHKH 
FAMLLVKHKH 
FISFTVKHVM 
CAMLLVKHKH 



....|. ...| 

3795 
LFLQVFLLPT 
LFLQVFLLPS 
LFFQVFLIPA 
LFFMSFVLPS 
LYLTMYITPV 
LYLTMYIIPV 
LYLTMFIMPV 
AYMDTFLLPT 
AFLCLFLLPS 



....|. ...| 

3805 
VIATALYN-- 
IIVAAIQN-- 
LIVTSCIN— 
VILVTAHN — 
LFTLLYNNY- 
LFTLLYNNY- 
LCTLFYTNY- 
LITVIIGVCA 
LATVAYFN— 



3815 
— CVLDYYIV 
— CAWDYHVT 
— LAFDVEVY 
--LFWDFSYY 
-LWYKHTFR 
-LWYKQTFR 
-LWYKQSFR 
EVPFIYNTLI 
-MVYMPASWV 



....|. ...| 

3825 
KFLADHFN-Y 
KVLAEKFD-Y 
NYLAEHFD-Y 
ESLQSIVENT 
G YV YAWLS YY 
G YV YAWLS YY 
GLAYAWLSHF 
SQWIFLSQW 
MRIMTWLELA 



• I • 



3835 
NVSVLQMDVQ 
NVSVMQMDIQ 
HVSLMGFNAQ 
NTMFLPVDMQ 
VPSVEYTYTD 
VPSVEYTYTD 
VPAVDYTYMD 
YDPWFDTMV 
DTSLSGYRLK 



....|. ...| ....|.. ..| |.. ..| ....| | ....|. ...| 

3845 3855 3865 3875 3885 3895 

EMCR GLVNVLVCLF WFLH TW RFSKERFTHW FTYVCSLIAV AYTYFYSGD F 

229E GFVNIFICLF VALLH TW RFAKERCTHW CTYLFSLIAV LYTALYSYD Y 

PEDV GLVNIFVCFV VTILHGTYTW RFFN-TPASS VTYWALLTA AYNYFYASD 1 

TGEV GVMLTVFCFI VFVTYSVRFF TCKQSWFSLA VTTILVIFNM VKIFGTSDEP WTENQIAFCF 

OV43 EVIYGMLLLV GMVFVTLRSI NHDLFS FIMF VGRLISVFSL WYKGSNLEEE I 

BoCoV EVIYGMLLLI GMVFVTLRSI NHDLFS FIMF VGRVISWSL WYMGSNLEEE 1 

MHV EVLYGVVLLV AMVFVTMRSI NHDVFSVMFL VGRLVSLVSM WYFGANLEEE V 

AIBV PWMFLPLVLY TAFKCVQGCY MNSFNTSLLM LYQFVKLGFV IYTSSNTLTA YTEGNWELFF 
SARS CoV DCVMYASALV LLI LMTARTV YDDAARRVWT LMNVITLVYK VYYGNALDQA 1 



■ ... I .... I ....I. ...I ...,|. ...| ....|....| ....|....| ....|....| 

3905 3915 3925 3935 3945 3955 

EMCR LSLLVMFLCA ISSDWYIGAI VFRLSRLIIF FSPE— SVFS VFGDVKLTLV VYLICGYLVC 

229E VSLLVMLLCA ISNEWYIGAI IFRICRFGVA FLPV— EYVS YFDGVKTVLL FYMLLGFVSC 

PEDV LSCAMTLFAS VTGNWFVGAV CYKVAVYMAL RFP TFVA IFGDIKSVMF CYLVLGYFTC 

TGEV VNMLTMIVSL TTKDWMWIA SYRIAYYIW CVMP-SAFVS DFGFMKCISI VYMACGYLFC 

OV43 LLMLASLFGT YTWT---TVL SMAVAKVIAK WVAVNVLYFT DIPQIKIVLL CYLFIGYIIS 

BoCoV LLMLASLFGT YTWT TAL SMAAAKVIAK WVAVNVLYFT DIPQIKIVLV CYLFIGYIIS 

MHV LLFLTSLFGT YTWT TML SLATAKVIAK WLAVNVLYFT DVPQVKLVLL SYLCIGYVCC 

AIBV ELVHTTVLAN VSSNSLIGLF VFKCAKWMLY YCN AT YLNNYVLMAV MVNCIGWLCT 

SARS CoV SMWALVISVT SNYSGVVTTI MFLARAIVFV CVEYYPLLFI TGNTLQCIML VYCFLGYCCC 

....I. ...I ....|....| ....|. ...| j | ....|....| ....|....| 

3965 3975 3985 3995 4005 4015 

EMCR TYWGILYWFN RFFKCTMGVY DFKVSAAEFK YMVANGLHAP YGPFDALWLS FKLLGIGGDR 

22 9E MYYGLLYWIN RFCKCTLGVY DFCVSPAEFK YMVANGLNAP NGPFDALFLS FKLMGIGGPR 

PEDV CFYGILYWFN RFFKVSVGVY DYTVSAAEFK YMVANGLRAP TGTLDSLLLS AKLIGIGGER 

TGEV CYYGILYWVN RFTCMTCGVY QFTVSAAELK YMTANNLSAP KNAYDAMILS AKLIGVGGKR 

OV43 CYWGLFSLMN SLFRMPLGVY NYKISVQELR YMNANGLRPP KNSFEALMLN FKLLGIGGVP 

BoCoV CYWGLFSLMN SLFRMPLGVY NYKISVQELR YMNANGLRPP KNSFEALMLN FKLLGIGGVP 

MHV CYWGVLSLLN SIFRMPLGVY NYKISVQELR YMNANGLRPP RNSFEALVLN FKLLGIGGVP 

AIBV CYFGLYWWVN KVFGLTLGKY NFKVSVDQYR YMCLHKINPP KTVWEVFSTN ILIQGIGGDR 

SARS CoV CYFGLFCLLN RYFRLTLGVY DYLVSTQEFR YMNSQGLLPP KSSIDAFKLN IKLLGIGGKP 



....I. ...I ....|.. ..| . ... I .... ) ....I.... I ....|.. ..| I .... I 

4025 4035 4045 4055 4065 4075 

EMCR CIKISTVQSK LTDLKCTNW LLGCLSSMNI AANSSEWAYC VDLHNKINLC DDPEKAQGML 

22 9E TIKVSTVQSK LTDLKCTNW LMGILSNMNI ASNSKEWAYC VEMHNKINLC DDPETAQELL 

PEDV NIKISSVQSK LTDIKCSNW LLGCLSSMNV SANSTEWAYC VDLHNKINLC NDPEKAQEML 

TGEV NIKISTVQSK LTEMKCTNW LLGLLSKMHV ESNSKEWNYC VGLHNEINLC DDPEIVLEKL 

OV43 IIEVSQFQSK LTDVKCANW LLNCLQHLHV ASNSKLWHYC STLHNEILAT SDLSVAFEKL 

BOCOV IIEVSQFQSK LTDVKCANGG LLNCLQHLHV ASNSKLWQYC STLHNEILAT SDLGVAFEKL 

MHV VIEVSQIQSR LTDVKCVNW LLNCLQHLHI ASSSKLWQYC STLHNEILAT SDLSVAFDKL 

AIBV VLPIATVQAK LSDVKCTTVV LMQLLTKLNV EANSKMHVYL VELHNKILAS DDVGECMDNL 

SARS COV CIKVATVQSK MSDVKCTSVV LLSVLQQLRV ESSSKLWAQC VQLHNDILLA KDTTEAFEKM 



....|....| ....|. ...| ....j.. ..| ....|....| . . . . I I 

4085 4095 4105 4115 4125 4135 

EMCR LALLAFFLSK HSDFG LDGLIDSYF DNSSTLQSVA SSFVSMPSYI AYENARQAYE 

229E LALLAFFLSK HSDFG LGDLVDSYF ENDSILQSVA SSFVGMPSFV AYETARQEYE 

PEDV LALLAFFLSK NSAFG- LDDLLESYF NDNSMLQSVA STYVGLPSYV IYENARQQYE 

TGEV LALIAFFLSK HNTCD LSELIESYF ENTTILQSVA SAYAALPSWI ALEKARADLE 

OV43 AQLLIVLFAN PAAVDSKCLT SIEEVCDDYA KDNTVLQALQ SEFVNMASFV EYEVAKKNLD 

BOCOV AQLLIVLFAN PAAVDSKCLT SIEEVCDDYA KDNTVLQALQ SEFVNMASFV EYEVAKKNLD 

MHV AQLLWLFAN PAAVDS KCLA SIEEVSDDYV RDSTVLQALQ SEFVNMASFV EYELAKKNLD 

AIBV LGMLITLFCI DSTID LSEYCDDIL KRSTVLQSVT QEFSHIPSYA EYERAKNLYE 

SARS CoV VSLLSVLLSM QGAVD INRLCEEML DNRATLQAIA SEFSSLPSYA AYATAQEAYE 
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4145 4155 4165 4175 4185 4195 

EMCR DAIANGSS SQLIKQLK RAMNIAKSEF DHEISVQKKI NRMAEQAATQ MYKEARSVNR 

229E NAVANGSS PQIIKQLK KAMNVAKAEF DRESSVQKKI NRMAEQAAAA MYKEARAVNR 

PEDV DAVNNGSP PQLVKQLR HAMNVAKSEF OREASTQRKL DRMAEQAAAQ MYKEARAVNR 

TGEV EAKKNDVS PQILKQLT KAFNIAKSDF EREASVQKKL OKMAEQAAAS MYKEARAVOR 

OV43 EARFSGSAN QQQLKQLE KACNIAKSAY ERDRAVAKKL ERMADLALTN MYKEARINDK 

BoCoV EACSSGSAN QQQLKQLE KACNIAKSAY ERDRAVARKL ERMADLALTN MYKEARINDK 

MHV EAKASGSAN QQQIKQLE KACNIAKSAY ERDRAVARKL ERMADLALTN MYKEARINDK 

AIBV KVLVDSKNGG VTQQELAAYR KAANIAKSVF DRDLAVQKKL DSMAERAMTT MYKEARVTDR 

SARS CoV QAVANGDS EWLKKLK KSLNVAKSEF DRDAAMQRKL EKMADQAMTQ MYKQARSEDK 

. ... I .... I I . ... I .... I t I ....I.. ..I ....|....| 

4205 4215 4225 4235 4245 4255 

EMCR KSKVISAMHS LLFGMLRRLD MSSVETVLNL ARDGVVPLSV IPATSASKLT IVSPDLESYS 

229E KSKVVSAMHS LLFGMLRRLD MSSVDTILNM ARNGWPLSV I PATSAARLV VWPDHDSFV 

PEDV KSKVVSAMHS LLFGMLRRLD MSSVDTILNL AKDGWPLSV • I PAVSATKLN IVTSDIDSYN 

TGEV KSKIVSAMHS LLFGMLKKLD MSSVNTIIDQ ARNGVLPLSI I PAASATRLV VITPSLEVFS 

OV43 KSKVVSALQT MLFSMVRKLD NQALNSILDN AVKGCVPLNA IPSLAANTLN IIVPDKSVYD 

BoCoV KSKVVSALQT MLFSMVRKLD NQALNSILDN AVKGCVPLNA IPSLAANTLT IIVPDKSVYD 

MHV KSKVVSALQT MLFSMIRKLD NQALNSILDN AVKGCVPLNA IPSLTSNTLT IIVPDKQVFD 

AIBV RAKLVSSLHA LLFSMLKKID SEKLNVLFDQ ASSGWPLAT VPIVCSNKLT LVIPDPETWV 

SARS CoV RAKVTSAMQT MLFTMLRKLD NDALNNIINN ARDGCVPLNI IPLTTAAKLM VWPDYGTYK 

| ....|....! ....I.... | . ... I .... I 
4265 4275 4285 4295 4305 4315 

EMCR KIVCDGSVHY AGWWTLNDV KDNDGRPVHV KEITRENVET LT WPL ILNCERWK- 

229E KMMVDGFVHY AGWWTLQEV KDNDGKNVHL KDVTKENQEI LV WPL ILTCERWK- 

PEDV RIQREGCVHY AGTIWNIIDI KDNDGKVVHV KEVTAQNAES LS WPL VLGCERIVK- 

TGEV KIRQENNVHY AGAIWTIVEV KDANGSHVHL KEVTAANELN LT WPL SITCERTTK- 

OV43 QVVDNVYVTY AGNVWQIQTI QDSDGTNKQL NEISDDCN WPL VI IANRYNE- 

BOCOV QVVDNVYVTY AGNVWQIQTI QDSDGTNKQL HEISDDCN WPL VIIANRHNE- 

MHV QVVDNVYVTY AGNVWHIQSI QDADGAVKQL NEIDVNIT WPL VIAANRHNE- 

AIBV KCVEGVHVTY STVVWNIDTV IDADGTELHP TSTGSGLTYC ISGANIAWPL KVNLTRNGHN 
SARS CoV NTCDGNTFTY ASALWEIQQV VDADSKIVQL SEINMDNSPN LA WPL IVTALRAN-- 

....|....| ....|....| ..,..|....| I .... I . ... I .... I 

4325 4335 4345 4355 4365 4375 

EMCR LQ-NNE IMPGKLKQKP MKAEG — DGG VLGDGNALYN TEGGKTFMYA YISNKADLKF 

229E LQ-NNE IMPGKMKVKA TKGEG— DGG ITSEGNALYN NEGGRAFMYA YVTTKPGMKY 

PEDV LQ-NNE I I PGKLKQRS IKAEG — DG- IVGEGKALYN NEGGRTFMYA FISDKPDLRV 

TGEV LQ-NNE IMPGKLKERA VRASATLDGE AFGSGKALMA SESGKSFMYA FIASDNNLKY 

OV43 VSATVLQNNE LMPAKLKIQV VNSGPDQTCN TPT — QCYYN NSNNGKIVYA ILSDVDGLKY 

BoCoV VSATVLQNNE LMPAKLKTQV VNSGPDQTCN TPT— QCYYN NSYNGKIVYA ILSDVDGLKY 

MHV VSSWLQNNE LMPQKLRTQV VNSGSDMNCN TPT — QCYYN TTGMGKIVYA ILSDCDGLKY 

AIBV KVDVVLQNNE LMPHGVKTKA CVAGVDQAHC SVES-KCYYT NISGNSWAA ITSSNPNLKV 

SARS CoV -SAVKLQNNE LSPVALRQMS CAAGTTQTAC TDDNALAYYN NSKGGRFVLA LLSDHQDLKW 

....I.... I ....|.... | ....|....| ....|,... | ....I.... I . ... I .... I 
4385 4395 4405 4415 4425 4435 

EMCR VKWEYEGG-- CNTIELDSPC RFMVETPNGP QVKYLYFVKN LNTLRRGAVL GFIGATIRLQ 

22 9E VKWEHDSG — VVTVELEPPC RFVIDTPTGP QIKYLYFVKN LNNLRRGAVL GYIGATVRLQ 

PEDV VKWEFDGG — CNTIELEPPR KFLVDSPNGA QIKYLYFVRN LNTLRRGAVL GYIGATVRLQ 

TGEV VKWESNND-- IIPIELEAPL RFYVDGANGP EVKYLYFVKN LNTLRRGAVL GYIGATVRLQ 

OV43 TKILKDDGN- FWLELDPPC KFTVQDAKGL KIKYLYFVKG CNTLARGWW GTISSTVRLQ 

BOCOV TKILKDDGN- FVVLELDPPC KFTVQDVKGL KIKYLYFVKG CNTLARGWW GTISSTVRLQ 

MHV TKIVKEDGN- CVVLELDPPC KFSVQDVKGL KIKYLYFVKG CNTLARGWW GTLSSTVRLQ 

AIBV ASFLNEAGN- QIYVDLDPPC KFGMKVGVKV EWYLYFIKN TRSIVRGMVL GAISNVVVLQ 

SARS CoV ARFPKSDGTG TIYTELEPPC RFVTDTPKGP KVKYLYFIKG LNNLNRGMVL GSLAATVRLQ 

....|.. .-I | | ....|....| I .... I ....I...- I 

4445 4455 4465 4475 4485 4495 

EMCR AG-KQTELAV NSGLLTACAF SVDPATTYLE AVKHGAKPVS NCIKMLSNGA GNGQAITTSV 

22 9E AG-KQTEFVS NSHLLTHCSF AVDPAAAYLD AVKQGAKPVG NCVKMLTNGS GSGQAITCTI 

PEDV AG-KQTEQAI NSSLLTLCAF AVDPAKTYID AVKSGHKPVG NCVKMLANGS GNGQAVTNGV 

TGEV AG-KPTEHPS NSSLLTLCAF SPDPAKAYVD AVKRGMQPVN NCVKMLSNGA GNGMAVTNGV 

OV43 AG-TATEYAS NSSILSLCAF SVDPKKTYLD FIQQGGTPIA NCVKMLCDHA GTGMAITVKP 

BOCOV AG-TATEYAS NSSILSLCAF SVDPKKTYLD FIQQGGTPIA NCVKMLCDHA GTGMAITVKP 

MHV AG-TATEYAS NSAIRSLCAF SVDPKKTYLD YIQQGGAPVT NCVKMLCDHA GTGMAITIKP 

AIBV SKGHETEEVD AVGILSLCSF AVDPADTYCK YVAAGNQPLG NCVKMLTVHN GSGFAITSKP 

SARS COV AG-NATEVPA NSTVLSFCAF AVDPAKAYKD YLASGGQPIT NCVKMLCTHT GTGQAITVTP 

....I.... I ....I.... I ....|....| ....|.... | ....|.... | ,...|.... | 
4505 4515 4525 4535 4545 4555 

EMCR DANTNQDSYG GASICLYCRA HVPHP S MDGYCKFKGK CVQVP-IGCL DPIRFCLENN 

22 9E DSNTTQDTYG GASVCIYCRA HVAHP-' T MDGFCQYKGK WVQVP-IGTN DPIRFCLENT 

PEDV EASTNQDSYG GASVCLYCRA HVEHP S MDGFCRLKGK YVQVP-LGTV DPIRFVLEND 

TGEV EANTQQDSYG GASVCIYCRC HVEHP A IDGLCRYKGK FVQIP-TGTQ DPIRFCIENE 

OV43 DATTSQDSYG GASVCIYCRA RVEHP D VDGLCKLRGK FVQVP-VGIK DPVSYVLTHD 

BoCoV DATTSQDSYG GASVCIYCRA RVEHP D VDGLCKLRGK FVQVP-VGIK DPVSYVLTHD 

MHV EATTNQDSYG GASVCIYCRS RVEHP D VDGLCKLRGK FVQVP-LGIK DPVSYVLTHD 

AIBV SPTPDQDSYG GASVCLYCRA HIAHPGSVGN LDGRCQFKGS FVQIP-TTEK DPVGFCLRNK 
SARS CoV EANMDQESFG GASCCLYCRC HIDHP N PKGFCDLKGK YVQI PTTCAN DPVGFTLRNT 
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I 1 ..-I ....I.. ..| . ... I .... I • • • • I I ..I 

4565 4575 4585 4595 4605 4615 

EMCR VCNVCGCWLG HGCACDRTTI QSVDIS YLNRARGSSA -ARLEPCN-G TDIDKCVRAF 

22 9E VCKVCGCWLN HGCTCDRTAI QSFDHS YLNRVRGSSA -ARLEPCN-G TDIDYCVRAF 

PEDV VCKVCGCWLS NGCTCDRSIM QSTDYG LFKRVRGSSA -ARLEPCN-G TDTQHVYRAF 

TGEV VCWCGCWLN NGCMCDRTSM QSFTV0QSY- LFKRVRGSSA -ARLEPCN-G TDPDHVSRAF 

OV43 VCRVCGFWRD GSCSCVSTDT TVQSKDTN— FFKRVRGTSV DARLVPCASG LSTDVQLRAF 

BoCoV VCQVCGFWRD GSCSCVSTDT TVQSKDT FFKRVRGTSV DARLVPCASG LSTDVQLRAF 

MHV VCQVCGFWRD GMFLCR-HRL PVSVKRHE-- LFKRVRGTSV NARLVPCASG LDTDVQLRAF 

AIBV VCTVCQCWIG YGCQCDSLRQ PKSSVQS VAGASD FDKNYLNG — YGVAVRLGMF 

SARS COV VCTVCGMWKG YGCSCDQLRE PLMQSADAST FLNRVCGVSA -ARLTPCGTG TSTDWYRAF 

....I.... I ....|....| ....|. ...| ....|....| ....). ...| 

4625 4635 4645 4655 4665 4675 

EMCR DIYNKNVSFL GKCLKMNCVR FKNADLK DGYFVIKRC TKSVMEHEQS HYNLLNFSGA 

229E DVYNKDASFI GKNLKSNCVR FKNVDKD DAFYIVKRC IKSVMDHEQS MYNLLKGCNA 

PEDV DIYNKDVACL GKFLKVNCVR LKNLDKH DAFYWKRC TKSAMEHEQS IYSRLEKCGA 

TGEV DIYNKDVACI GKFLKTNCSR FRNLDKH DAYYIVKRC TKTVMDHEQV CYNDLKDSGA 

OV43 DIYNASVAGI GLHLKVNCCR FQRVDENGDK LDQFFWKRT DLTIYNREMK CYERVKDCKF 

BoCoV DICNASVAGI GLHLKVNCCR FQRVDENGDK LDQFFWKRT DLTIYNREME CYERVKDCKF 

MHV OICNANRAGI GLYYKVNCCR FQRADEDGNT LDKFFVIRRT NLEVYNKEKE CYELTKECGV 

AIBV QNLKRNCARF QELRDTEDGN LEYLDS YFWKQT TPSNYEHEKS CYEDLKS-EV 

SARS CoV DIYNEKVAGF AKFLKTNCCR FQEKDEEGNL LDSYFWKRH TMSNYQHEET IYNLVKDCPA 

....I. ...| ....|. ...| ...,|. ...| . ... I .... I ....|. ...| 
4685 4695 4705 4715 4725 4735 

EMCR LAEHDFFTWK DGRVIYGNVS RHNLTKYTMM DLVYAMRNFD EQNCDVLKEV LVLTGCCDNS 

22 9E VAKHDFFTWH EGRTIYGNVS RQDLTKYTMM DLCFALRNFD EKDCEVFKEI LVLTGCCSTD 

PEDV IAEHDFFTWK DGRAIYGNVC RKDLTEYTMM DLCYALRNFD ENNCDVLKSI LIKVGACEES 

TGEV VAEHDFFTYK EGRCEFGNVA RRNLTKYTMM DLCYAIRNFD EKNCEVLKEI LVTVGACTEE 

OV43 VAEHDFFTFD VEGSRVPHIV RKDLTKYTML DLCYALRHFD RNDCMLLCDI LSIYAGCEQS 

BOCOV VAEHDFFTFD VEGSRVPHIV RKDLTKYTML DLCYALRHFD RNDCMLLCDI LSIYAGCEQS 

MHV VAEHEFFTFD VEGSRVPHIV RKDLSKYTML DLCYALRHFD RNDCSTLKEI LLTYAECDES 

AIBV TADHDFFVFN KN— IYNIS RQRLTKYTMM DFCYALRHFD PKDCEVLKEI LVTYGCIEDY 

SARS CoV VAVHDFFKFR VDGDMVPHIS RQRLTKYTMA DLVYALRHFD EGNCDTLKEI LVTYNCCDDD 

I I ....I. ...I ....|...,| I I ....I. ...I 

4745 4755 4765 4775 4785 4795 

EMCR YFDSKG WYDPVENEDI HRVYASLGKI VARAMLKCVA LCDAMVAKGV VGVLTLDNQD 

229E YFEMKN WFDPIENEDI HRVYAALGKV VANAMLKCVA FCDEMVLKGV VGVLTLDNQD 

PEDV YFNNKV WFDPVENEDI HRVYALLGTI VARAMLKCVK FCDAMVEQGI VGWTLDNQD 

TGEV FFENKD WFDPVENEAI HEVYAKLGPI VANAMLKCVA FCDAIVEKGY -IGVITLDNQD 

OV43 YFTKKD WYDFVENPDI INVYKKLGPI FNRALVSATE FADKLVEVGL VGVLTLDNQD 

BOCOV ——YFTKKD WYDFVENPDI INVYKKLGPI FNRALVSATE FADKLVEVGL VGILTLDNQD 

MHV YFQKKD WYDFVENSDI INVYKKLGPI FNRALLNTAK FADTLVEAGL VGVLTLDNQD 

AIBV HPKWFEENKD WYDPIENSKY YVMLAKMGPI VRRALLNAIE FGNLMVEKGY VGVITLDNQD 

SARS COV YFNKKD WYDFVENPDI LRVYANLGER VRQSLLKTVQ FCDAMRDAGI VGVLTLDNQD 

-.-.I-. ..I ....|....| ....|. ...| ....|. ...| ,...|....| ....|. ...| 
4805 4815 4825 4835 4845 4855 

EMCR LNGNFYDFGD FVVSLPNMGV PCCTSYYSYM MPIMGLTNCL ASECFVKSDI FGSDFKTFDL 

229E LNGNFYDFGD FVLCPPGMGI PYCTSYYSYM MPVMGMTNCL ASECFMKSDI FGQDFKTFDL 

PEDV LNGDFYDFGD FTCSIKGMGV PICTSYYSYM MPVMGMTNCL ASECFVKSDI FGEDFKSYDL 

TGEV LNGNFYDFGD FVKTAPGFGC ACVTSYYSYM MPLMGMTSCL ESENFVKSDI YGSDYKQYDL 

OV43 LNGKWYDFGD YVIAAPGCGV AIADSYYSYI MPMLTMCHAL DCELYVNN— — -AYRLFDL 

BOCOV LNGKWYDFGD YVIAAPGCGV AIADSYYSYM MPMLTMCHAL DCELYVNN AYRLFDL 

MHV LYGQWYDFGD FVKTVPGCGV AVADSYYSYM MPMLTMCHAL DSELFING- TYREFDL 

AIBV LNGKFYDFGD FQKTAPGAGV PVFDTYYSYM MPIIAMTDAL APERYFEYDV H-KGYKSYDL 

SARS CoV LNGNWYDFGD FVQVAPGCGV PIVDSYYSLL MPILTLTRAL AAESHMDADL A-KPLIKWDL 

....I. ...I ...J. ...I ....|....| . ... I .... | ....!....! 

4865 4875 4885 4895 4905 4915 

EMCR LKYDFTEHKE NLFNKYFKHW SFDYHPNCSD CYDDMCVIHC ANFNTLFATT IPGTAFGPLC 

229E LKYDFTEHKE VLFNKYFKYW GQDYHPDCVD CHDEMCILHC SNFNTLFATT IPNTAFGPLC 

PEDV LEYDFTEHKT ALFNKYFKYW GLQYHPNCVD CSDEQCIVHC ANFNTLFSTT IPITAFGPLC 

TGEV LAYDFTEHKE YLFQKYFKYW DRTYHPNCSD CTSDECIIHC ANFNTLFSMT IPMTAFGPLV 

OV43 VQYDFTDYKL ELFNKYFKHW SMPYHPNTVD CQDDRCIIHC ANFNILFSMV LPNTCFGPLV 

BOCOV VQYDFTDYKL ELFNKYFKHW SMPYHPNTVD CQDDRCIIHC ANFNILFSMV LPNTCFGPLV 

MHV VQYDFTDFKL ELFNKYFKYW SMTYHPNTCE CEDDRCIIHC ANFNILFSMV LPKTCFGPLV 

AIBV LKYDYTEEKQ ELFQKYFKYW DQEYHPNCRD CSDDRCLIHC ANFNILFSTL IPQTSFGNLC 

SARS COV LKYDFTEERL CLFDRYFKYW DQTYHPNCIN CLDDRCILHC ANFNVLFSTV FPPTSFGPLV 

....I.. .-I ....|. ...| ....|....| . ... | .... | ....I. ...I 

4925 4935 4945 4955 4965 4975 

EMCR RKVFIDGVPL VTTAGYHFKQ LGLVWNKDVN THSVRLTITE LLQFVTDPSL IIASSPALVD 

22 9E RKVFIDGVPV VATAGYHFKQ LGLVWNKDVN THSTRLTITE LLQFVTDPTL IVASSPALVD 

PEDV RKCWIDGVPL VTTAGYHFKQ LGIVWNNDLN LHSSRLSINE LLQFCSDPAL LIASSPALVD 

TGEV RKVHIDGVPV WTAGYHFKQ LGIVWNLDVK LDTMKLSMTD LLRFVTDPTL LVASSPALLD 

OV43 RQIFVDGVPF WSIGYHYKE LGIVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALYD 

BoCoV RQIFVDGVPF WSIGYHYKE LGIVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALYD 

MHV RQIFVDGVPF WSIGYHYKE LGVVMNMDVD THRYRLSLKD LLLYAADPAL HVASASALLD 

AIBV RKVFVDGVPF IATCGYHSKE LGVIMNQDNT MSFSKMGLSQ LMQFVGDPAL LVGTSNNLVD 

SARS CoV RKIFVDGVPF WSTGYHFRE LGWHNQDVN LHSSRLSFKE LLVYAADPAM HAASGNLLLD 
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....I. ...I ....I.. ..| 1 .... I 1 1 ....I. ...I I 1 

4985 4995 5005 5015 5025 5035 

EMCR QRTICFSVAA LSTGLTNQW KPGHFNEEFY NFLRLRGFFD EGSELTLKHF FFAQNGDAAV 

229E KRTVCFSVAA LSTGLTSQTV KPGHFNKEFY DFLRSQGFFD EGSELTLKHF FFTQKGDAAI 

PEDV QRTVCFSVAA LGTGMTNQTV KPGHFNKEFY DFLLEQGFFS EGSELTLKHF FFAQKVDAAV 

TGEV QRTVCFSIAA LSTGITYQTV KPGHFNKDFY DFITERGFFE EGSELTLKHF FFAQGGEAAM 

OV43 LRTCCFSVAA ITSGVKFQTV KPGNFNQDFY DFVLSKGLLK EGSSVDLKHF FFTQDGNAAI 

BOCOV LRTCCFSVAA ITSGVKFQTV KPGNFNQDFY DFILSKGLLK EGSSVDLKHF FFTQDGNAAI 

MHV LRTCCFSVAA ITSGVKFQTV KPGNFNQDFY EFILSKGLLK EGSSVDLKHF FFTQDGNAAI 

AIBV LRTSCFSVCA LTSGITHQTV KPGHFNKDFY DFAEKAGMFK EGSSIPLKHF FYPQTGNAAI 

SARS CoV KRTTCFSVAA LTNNVAFQTV KPGNFNKDFY DFAVSKGFFK EGSSVELKHF FFAQDGNAAI 

I ....|. ...| ....|....| ....|.. ..| ....|.... | 

5045 5055 5065 5075 5085 5095 

EMCR KDFDFYRYNK PTILDICQAR VTYKIVSRYF DIYEGGCIKA CEWVTNLNK SAGWPLNKFG 

229E KDFDYYRYNR PTMLDIGQAR VAYQVAARYF DCYEGGCITS REVWTNLNK SAGWPLNKFG 

PEDV KDFDYYRYNR PTVLDICQAR WYQIVQRYF DIYEGGCITA KEVWTNLNK SAGYPLNKFG 

TGEV TDFNYYRYNR VTVLDICQAQ FVYKIVGKYF ECYDGGCINA REVWTNYDK SAGYPLNKFG 

OV43 TDYNYYKYNL PTMVDIKQLL FVLEVVYKYF EIYDGGCIPA SQVIVNNYDK SAGYPFNKFG 

BoCoV TDYNYYKYNL PTMVDIKQLL FVLEVVYKYF EIYDGGCIPA AQVIVNNYDK SAGYPFNKFG 

MHV TDYNYYKYNL PTMVDIKQLL FVLEWNKYF EIYDGGCIPA TQVIVNNYDK SAGYPFNKFG 

AIBV NDYDYYRYNR PTMFDICQLL FCLEVTSKYF ECYEGGCIPA SQWVNNLDK SAGYPFNKFG 

SARS COV SDYDYYRYNL PTMCDIRQLL FWEVVDKYF DCYDGGCINA NQVIVNNLDK SAGFPFNKWG 

....|....| | ....!....! ....|.. ..| I I ....|....| 

5105 5115 5125 5135 5145 5155 

EMCR KASLYYESIS YEEQDALFAL TKRNVLPTMT QLNLKYAI SG KERARTVGGV SLLSTMTTRQ 

229E KAGLYYESIS YEEQDAIFSL TKRNILPTMT QLNLKYAI SG KERARTVGGV SLLATMTTRQ 

PEDV KAGLYYESLS YEEQDELYAY TKRNILPTMT QLNLKYAI SG KERARTVGGV SLLSTMTTRQ 

TGEV KARLYYETLS YEEQDALFAL TKRNVLPTMT QMNLKYA I SG KARARTVGGV SLLSTMTTRQ 

OV43 KARLYYEALS FEEQDEIYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM 

BoCoV KARLYYEALS FEEQDEIYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM 

MHV KARLYYEALS FEEQDEVYAY TKRNVLPTLT QMNLKYAISA KNRARTVAGV SILSTMTGRM 

AIBV KARLYYEMS- LEEQDQLFEI TKKNVLPTIT QMNLKYAISA KNRARTVAGV SILSTMTNRQ 

SARS CoV KARLYYDSMS YEDQDALFAY TKRNVIPTIT QMNLKYAISA KNRARTVAGV SICSTMTNRQ 

....|. ...I ....|....| ....|....| ....I. ...I ....|.... | . ... I .... I 
5165 5175 5185 5195 5205 5215 

EMCR YHQKHLKSIV NTRNATWIG TTKFYGGWNN MLRTLIDGVE NPMLMGWDYP KCDRALPNMI 

22 9E FHQKCLKSIV ATRNATVVIG TTKFYGGWDN MLKNLMADVD DPKLMGWDYP KCDRAMPSMI 

PEDV YHQKHLKSIV NTRGASWIG TTKFYGGWDN MLKNLI DGVE NPCLMGWDYP KCDRALPNMI 

TGEV YHQKHLKSIA ATRNATVVIG STKFYGGWDN MLKNLMRDVD NGCLMGWDYP KCDRALPNMI 

OV43 FHQKCLKSIA ATRGVPVVIG TTKFYGGWDD MLRRLIKDVD NPVLMGWDYP KCDRAMPNLL 

BoCoV FHQKCLKSIA ATRGVPVVIG TTKFYGGWDD MLRRLIKDVD NPVLMGWDYP KCDRAMPNIL 

MHV FHQKCLKSIA ATRGVPVVIG TTKFYGGWDD MLRRLIKDVD SPVLMGWDYP KCDRAMPNIL 

AIBV FHQKILKSIV NTRNASWIG TTKFYGGWDN MLRNLIQGVE DPILMGWDYP KCDRAMPNLL 

SARS CoV FHQKLLKSIA ATRGATVVIG TSKFYGGWHN MLKTVYSDVE TPHLMGWDYP KCDRAMPNML 

....|....| ....|....| I .... i ....I.... I I { 

5225 5235 5245 5255 5265 5275 

EMCR RMISAMVLGS KHVNCCTVTD RFYRLGNELA QVLTEVVYSN GGFYFKPGGT TSGDASTAYA 

229E RMLSAMILGS KHVTCCTASD KFYRLSNELA QVLTEVVYSN GGFYFKPGGT TSGDATTAYA 

PEDV RMISAMILGS KHTTCCSSTD RFFRLCNELA QVLTEVVYSN GGFYLKPGGT TSGDATTAYA 

TGEV RMASAMILGS KHVGCCTHND RFYRLSNELA QVLTEWHCT GGFYFKPGGT TSGDGTTAYA 

OV43 RIVSSLVLAR KHETCCSQSD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA 

BoCoV RIVSSLVLAR KHEACCSQSD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA 

MHV RIISSLVLAR KHDSCCSHTD RFYRLANECA QVLSEIVMCG GCYYVKPGGT SSGDATTAFA 

AIBV RIAASLVLAR KHTNCCSWSE RIYRLYNECA QVLSETVLAT GGIYVKPGGT SSGDATTAYA 

SARS CoV RIMASLVLAR KHNTCCNLSH RFYRLANECA QVLSEMVMCG GSLYVKPGGT SSGDATTAYA 

I I ....I.. ..I ...,|....| ,...|....| ....|.. ..| I I 

5285 5295 5305 5315 5325 5335 

EMCR NSIFNIFQAV SSNINRLLSV PSDSCNNVNV RDLQRRLYDN CYRLTSVEES FIDDYYGYLR 

229E NSVFNIFQAV SSNINCVLSV NSSNCNNFNV KKLQRQLYDN CYRNSNVDES FVDDFYGYLQ 

PEDV NSVFNIFQAV SANVNKLLSV DSNVCHNLEV KQLQRKLYEC CYRSTIVDDQ FWEYYGYLR 

TGEV NSAFNIFQAV SANVNKLLGV DSNACNNVTV KSIQRKIYDN CYRSSSIDEE FWEYFSYLR 

OV43 NSVFNICQAV SANVCALMSC NGNKIEDLSI RALQKRLYSH VYRSDKVDST FVTEYYEFLN 

BoCoV NSVFNICQAV SANVCALMSC NGNKIEDLSI RALQKRLYSH VYRSDMVDST FVTEYYEFLN 

MHV NSVFNICQAV SANVCSLMAC NGHKIEDLSI RELQKRLYSN VYRADHVDPA FVNEYYEFLN 

AIBV NSVFNIIQAT SANVARLLSV ITRDIVYDNI KSLQYELYQQ VYRRVNFDPA FVEKFYSYLC 

SARS COV NSVFNICQAV TANVNALLST DGNKIADKYV RNLQHRLYEC LYRNRDVDHE FVDEFYAYLR 

....I. ...I -...I.- ..I |. ...| ....|. ...| ....|. ...| ....t....| 

5345 5355 5365 5375 5385 5395 

EMCR KHFSMMILSD DGVVCYNKDY AELGYIADIS AFKATLYYQN NVFMSTSKCW VEEDLTKGPH 

22 9E KHFSMMILSD DSVVCYNKTY AGLGYIADIS AFKATLYYQN GVFMSTAKCW TEEDLSIGPH 

PEDV KHFSMMILSD DGVVCYNNDY ASLGYVADLN AFKAVLYYQN NVFMSASKCW IEPDINKGPH 

TGEV KHFSMMILSD DGVVCYNKDY ADLGYVADIN AFKATLYYQN NVFMSTSKCW VEPDLSVGPH 

OV43 KHFSMMILSD DGVVCYNSDY ASKGYIANIS AFQQVLYYQN NVFMSESKCW VEHDINNGPH 

BoCoV KHFSMMILSD DGVVCYNSDY ASKGYIANIS AFQQVLYYQN NVFMSESKCW VENDINNGPH 

MHV KHFSMMILSD DGVVCYNSEF ASKGYIANIS AFQQVLYYQN NVFMSEAKCW VETDIEKGPH 

AIBV KNFSLMILSD DGVVCYNNTL AKQGLVADlS GFREVLYYQN NVFMADSKCW VEPDLEKGPH 

SARS CoV KHFSMMILSD DAVVCYNSNY AAQGLVASIK NFKAVLYYQN NVFMSEAKCW TETDLTKGPH 
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....I. ...I ...,|. ...| ....|. ...| ....|....| ....|....| 

5405 5415 5425 5435 5445 5455 

EMCR EFCSQHTMQI VDKDGTYYLP YPDPSRILSA GVFVDDWKT DAWLLXRYV SLAIDAYPLS 

22 9E EFCSQHTMQI VDEKGKYYLP YPDPSRIISA GVFVDDITKT DAVILLERYV SLAIDAYPLS 

PEDV EFCSQHTMQI VDKEGTYYLP YPDPSRILSA GVFVDDWKT DAWLLERYV SLAIDAYPLS 

TGEV EFCSQHTLQI VGPDGDYYLP YPDPSRILSA GVFVDDIVKT DNVIMLERYV SLAIDAYPLT 

OV43 EFCSQHTMLV KMDGDDVYLP YPNPSRILGA GCFVDDLLKT DSVLLIERFV S LA I DAY PL V 

SoCoV EFCSQHTMLV KMDGDDVYLP YPVPSRILGA GCFVDDLLKT DSVLLIERFV S LA I DAY PL V 

MHV EFCSQHTMLV KMDGDEVYLP YPDPSRILGA GCFVDDLLKT DSVLLIERFV SLAIDAYPLV 

AIBV EFCSQHTMLV EVDGEPKYLP YPDPSRILGA CVFVDDVDKT EPVAVMERYI ALAIDAYPLV 

SARS CoV EFCSQHTMLV KQGDDYVYLP YPDPSRILGA GCFVDDIVKT DGTLMIERFV SLAIDAYPLT 

....|....| I I ....|....| ....|.... | ....|....| ,...|....| 

5465 5475 5485 5495 5505 5515 

EMCR KHPNSEYRKV FYVLLDWVKH LNKNLNEGVL ESFSVTLLDN QEDKFWCEDF YASMYENSTI 

229E KHPKPEYRKV FYALLDWVKH LNKTLNEGVL ESFSVTLLDE HESKFWDESF YASMYEKSTV 

PEDV KHENPEYKKV FYVLLDWVKH LYKTLNAGVL ESFSVTLLED STAKFWDESF YANMYEKSAV 

TGEV KHPKPAYQKV FYTLLDWVKH LQKNLNAGVL DSFSVTMLEE GQDKFWSEEF YASLYEKSTV 

OV43 YHENEEYQKV FRVYLAYIKK LYNDLGNQIL DSYSVILSTC DGQKFTDESF YKNMYLRSAV 

BoCoV YHENEEYQKV FRVYLEYIKK LYNELGNQIL DSYSVILSTC DGQKFTDESF YKNMYLRSAV 

MHV YHENPEYQNV FRVYLEYIKK LYNDLGNQIL DSYSVILSTC DGQKFTDETF YKNMYLRSAV 

AIBV HHENEEYKKV FFVLLAYIRK LYQELSQNML MDYSFVMDID KGSKFWEQEF YENMYRAPTT 

SARS CoV KHPNQEYADV FHLYLQYIRK LHDELTGHML DMYSVMLTND NTSRYWEPEF YEAMYTPHTV 

....I.... I - ... I .... I . ... I .... I ....I.. ..| ....|. ...I ....{....| 
5525 5535 5545 5555 5565 5575 

EMCR LQAAGLCWC GSQTVLRCGD CLRKPMLCTK CAYDHVFGTD HKFILAITPY VCNASGCGVS 

229E LQAAGLCWC GSQTVLRCGD CLRRPMLCTK CAYDHVFGTD HKFILAITPY VCNTSGCNVN 

PEDV LQSAGLCWC GSQTVLRCGD CLRRPMLCTK CAYDHVIGTT HKFILAITPY VCCASDCGVN 

TGEV LQAAGMCWC GSQTVLRCGD CLRRPLLCTK CAYDHVMGTK HKFIMSITPY VCSFNGCNVN 

OV43 MQSVGACWC SSQTSLRCGS CIRKPLLCCK CCYDHVMATD HKYVLSVSPY VCNAPGCDVN 

BoCoV MQSVGACWC SSQTSLRCGS CIRKPLLCCK CCYDHVMATD HKYVLSVSPY VCNAPGCDVN 

MHV MQSVGACWC SSQTSLRCGS CIRKPLLCCK CAYDHVMSTD HKYVLSVSPY VCNSPGCDVtJ 

AIBV LQSCGVCWC NSQTILRCGN CIRKPFLCCK CCYDHVMHTD HKNVLSINPY ICSQLGCGEA 

SARS CoV LQAVGACVLC NSQTSLRCGA CIRRPFLCCK CCYDHVISTS HKLVLSVNPY VCNAPGCDVT 

.... I .... I .... | .... | .... | .... | .... | .... | .... | | ....!..;.) 

5585 5595 5605 5615 5625 5635 

EMCR DVKKLYLGGL NYYCTNHKPQ LSFPLCSAGN IFGLYKNSAT GSLDVEVFNR LATSDWTDVR 

229E DVTKLYLGGL NYYCVDHKPH LSFPLCSAGN VFGLYKSSAL GSMDIDVFNK LSTSDWSDIR 

PEPV DVTKLYLGGL SYWCHEHKPR LAFPLCSAGN VFGLYKNSAT GSPDVEDFNR IATSDWTDVS 

TGEV DVTKLFLGGL SYYCMNHKPQ LSFPLCANGN VFGLYKSSAV GSEAVEDFNK LAVSDWTNVE 

OV43 . DVTKLYLGGM SYYCEDHKPQ YSFKLVMNGL VFGLYKQSCT GSPYIDDFNR IASCKWTDVD 

BOCOV DVTKLYLGGM SYYCEDHKPQ YSFKLVMNGM VFGLYKQSCT GSPYIDDFNR IASCKWTDVD 

MHV DVTKLYLGGM SYYCEDHKPQ YSFKLVMNGM VFGLYKQSCT GSPYIEDFNK IASCKWTEVD 

AIBV DVTKLYLGGM SYFCGNHKPK LSIPLVSNGT VFGIYRANCA GSENVDDFNQ LATTNWSIVE 

SARS CoV DVTQLYLGGM SYYCKSHKPP ISFPLCANGQ VFGLYKNTCV GSDNVTDFNA IATCDWTNAG 

...,|. ...| I .... I ....I. ...| ....|....| ....|....| ....|. ...| 

5645 5655 5665 5675 5685 5695 

EMCR DYKLANDVKD TLRLFAAETI KAKEESVKSS YAFATLKEW GPKELLLSWE SGKVKPPLNR 

229E DYKLANDAKE SLRLFAAETV KAKEESVKSS YAYATLKEIV GPKELLLLWE SGKAKPPLNR 

PEDV DYRLANDVKD SLRLFAAETI KAKEESVKSS YACATLHEVV GPKELLLKWE VGRPKPPLNR 

TGEV DYKLANNVKE SLKIFAAETV KAKEESVKSE YAYAVLKEVI GPKEIVLQWE ASKTKPPLNR 

OV43 DYILANECTE RLKLFAAETQ KATEEAFKQS YASATIQEIV SERELILSWE IGKVKPPLNK 

BoCoV DYILANECTE RLKLFAAETQ KATEEAFKQS YASATIQEIV SERELILSWE IGKVKPPLNK 

MHV DYVLANECTE RLKLFAAETQ KATEESFKQC YASATIREIV SDRELILSWE IGKVRPPLNK 

AIBV PYILANRCSD SLRRFAAETV KATEELHKQQ FASAEVREVF SDRELILSWE PGKTRPPLNR 

SARS CoV DYILANTCTE RLKLFAAETL KATEETFKLS YGIATVREVL SDRELHLSWE VGKPRPPLNR 

....|.. ..I ....|. ...| ....I.. ..| . ... I .... I . ... I .... I 
5705 5715 5725 5735 5745 5755 

EMCR NSVFTCFQIS KDSKFQIGEF IFEKVEYGSD TVTYKSTVTT KLVPGMIFVL TSHNVQPLRA 

229E NSVFTCFQIT KDSKFQVGEF VFEKVDYGSD TVTYKSTATT KLVPGMLFIL TSHNVAPLRA 

PEDV ' NSVFTCYHIT KNTKFQIGEF VFEKAEYDND AVTYKTTATT KLVPGMVFVL TSHNVQPLRA 

TGEV NSVFTCFQIS KDTKIQLGEF VFEQSEYGSD SVYYKSTSTY KLTPGMIFVL TSHNVSPLKA 

OV43 NYVFTGYHFT KNGKTVLGEY VFDKSELT-N GVYYRATTTY KLSVGDVFVL TSHSVANLSA 

BoCoV NYVFTGYHFT KNGKTVLGEY VFDKSELT-N GVYYRATTTY KLSVGDVFVL TSHSVANLSA • 

MHV NYVFTGYHFT SNGKTVLGEY VFDKSELT-N GVYYRATTTY KLSVGDVFIL TSHAVSSLSA 

AIBV NYVFTGYHFT RTSKVQLGDF TFEKGEGK-D WYYKATSTA KLSVGDIFVL TSHNVVSLVA 

SARS COV NYVFTGYRVT KNSKVQIGEY TFEKGDYG-D AWYRGTTTY KLNVGDYFVL TSHTVMPLSA 

. ... I .... I ..| ....|.. ..| ....| ....| ....|....| 

5765 5775 5785 5795 5805 5815 

EMCR PTIANQEKYS SIYKLHPAFN VSDAYANLVP YYQLIGKQKI TTIQGPPGSG KSHCSIGLGL 

229E PTMANQEKYS TIYKLHPSFN VSDAYANLVP YYQLIGKQRI TTIQGPPGSG KSHCSIGIGV 

PEDV PTIANQERYS TIHKLHPAFN IPEAYSSLVP YYQLIGKQKI TTIQGPPGSG KSHCVIGLGL 

TGEV PILVNQEKYN TISKLYPVFN IAEAYNTLVP YYQMIGKQKF TTIQGPPGSG KSHCVIGLGL 

OV43 PTLVPQENYS SIR-FASVYS VLETFQNNW NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV 

BOCOV PTLVPQENYS SIR-FASVYS VLETFQNNW NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV 

MHV PTLVPQENYT SIR-FASVYS VPETFQNNVP NYQHIGMKRY CTVQGPPGTG KSHLAIGLAV 

AIBV PTLCPQQTFS RFVNLRPNVM VPECFVNNIP LYHLVGKQKR TTVQGPPGSG KSHFAIGLAV 

SARS CoV PTLVPQEHYV RITGLYPTLN ISDEFSSNVA NYQKVGMQKY STLQGPPGTG KSHFAIGLAL 
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....I. ...I ,...|. ...| ....|....| . ... | .... I ....I. ...I ....|....| 
5825 5835 5845 5855 5865 5875 

EMCR YYPGARIVFV ACAHAAVDSL CAKAMTVYSI DKCTRIIPAR ARVECYSGFK PNNTSAQYIF 

229E YYPGARIVFT ACSHAAVDSL CAKAVTAYSV DKCTRIIPAR ARVECYSGFK PNNNSAQYVF 

PEDV YYPGARIVFT ACSHAAVDSL CVKASTAYSN DKCSRIIPQR ARVECYDGFK SNNTSAQYLF 

TGEV YYPQARIVYT ACSHAAVDAL CEKAAKNFNV DRCSRIIPQR IRVDCYTGFK PNNTNAQYLF 

OV43 FYCTARVVYT AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVECYDKFK INDTTRKYVF 

BOCOV YYCTARWYT AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVECYDKFK INDTTRKYVF 

MHV YYCTARWYT AASHAAVDAL CEKAYKFLNI NDCTRIVPAK VRVDCYDKFK VNDTTRKYVF 

AIBV YFSSARWFT ACSHAAVDAL CEKAFKFLKV DDCTRIVPQR TTVDCFSKFK ANDTGKKYIF 

SARS COV YYPSARIVYT ACSHAAVDAL CEKALKYLPI DKCSRIIPAR ARVECFDKFK VNSTLEQYVF 

....I. ...I ....I.-.. I ....|....| ....|.... | . ... 1 .... I ....|.... | 
5885 5895 5905 5915 5925 5935 

EMCR STVNALPECN ADIVWDEVS MCTNYDLSVI NQRLSYKHIV YVGDPQQLPA PRVMITKGVM 

229E STVNALPEVN ADIVWDEVS MCTNYDLSVI NQRISYKHIV YVGDPQQLPA PRVLISKGVM 

PEDV STVNALPECN ADIWVDEVS MCTNYDLSVI NQRISYRHW YVGDPQQLPA PRVMISRGTL 

TGEV CTVNALPEAS CDIWVDEVS MCTNYDLSVI NSRLSYKHIV YVGDPQQLPA PRTLINKGVL 

OV43 TTINALPEMV TDIVWDEVS MLTNYELSVI NARIRAKHYV YIGDPAQLPA PRVLLSKGTL 

BOCOV TTINALPEMV TDIWVDEVS MLTNYELSVI NARIRAKHYV YIGDPAQLPA PRVLLSKGTL 

MHV TTINALPELV TDIIVVDEVS MLTNYELSVI NSRVRAKHYV YIGDPAQLPA PRVLLNKGTL 

AIBV STINALPEVS CDILLVDEVS MLTNYELSFI NGKINYQYW YVGDPAQLPA PRTLLN-GSL 

SARS CoV CTVNALPETT ADIWFDEIS MATNYDLSW NARLRAKHYV YIGDPAQLPA PRTLLTKGTL 

....|.. .-I . ... I .... I ....!....! ....|. ...| I .... I 

5945 5955 5965 5975 5985 5995 

EMCR EPVDYNWTQ RMCAIGPDVF LHKCYRCPAE IVNTVSELVY ENKFVPVKPA SKQCFKIFFK 

229E EPIDYNVVTQ RMCAIGPDVF LHKCYRCPAE IVNTVSELVY ENKFVPVKEA SKQCFKIFER 

PEDV EPKDYNWTQ RMCALKPDVF LHKCYRCPAE IVRTVSEMVY ENQFIPVHPD SKQCFKIFCK 

TGEV QPQDYNVVTK RMCTLGPDVF LHKCYRCPAE IVKTVSALVY ENKFVPVNPE SKQCFKMFVK 

OV43 EPKYFNTVTK LMCCLGPDIF LGTCYRCPKE IVDTVSALVY ENKLKAKNES SSLCFKVYYK 

BOCOV EPKYFNTVTK LMCCLGPDIF LGTCYRCPKE IVDTVSALVY ENKLKAKNES SSLCFKVYYK 

MHV EPRYFNSVTK LMCCLGPDIF LGTCYRCPKE IVDTVSALVY HNKLKAKNDN SSMCFKVYYK 

AIBV SPKDYNWTN LMVCVKPDIF LAKCYRCPKE IVDTVSTLVY DGKFIANNPE SRECFKVIVN 

SARS CoV EPEYFNSVCR LMKTIGPDMF LGTCRRCPAE IVDTVSALVY DNKLKAHKDK SAQCFKMFYK 

....|. ...| - ... I .... I ....!.. ..| ....|. ...| ....|.. ..| ....|....| 
6005 6015 6025 6035 6045 6055 

EMCR GNVQVDN GSSINRKQLE IVKLFLVKNP SWSKAVFISP YNSQNYVASR FLGLQIQTVD 

229E GSVQVDN GSSINRRQLD WKRFIHKNS TWSKAVFISP YNSQNYVAAR LLGLQTQTVD 

PEDV GNVQVDN GSSINRRQLD VVRMFLAKNP RWSKAVFISP YNSQNYVASR LLGLQIQTVD 

TGEV GQVQIES NSSINNKQLE VVKAFLAHNP KWRKAVFISP YNSQNYVARR LLGLQTQTVD 

0V43 - — GVTTHES SSAVNMQQIY LINKFLKANP LWHKAVFISP YNSQNFAAKR VLGLQTQTVD 

BOCOV GVTTHES SSAVNMQQIY LINKFLKANP LWHKAVFISP YNSQNFAAKR VLGLQTQTVD 

MHV GQTTHES SSAVNMQQIY LISKFLKANP SWSNAVFISP YNSQNYVAKR VLGLQTQTVD 

AIBV NGNSDVGHES GSAYNTTQLE FVKDFVCRNK QWREAIFISP YNAMNQRAYR MLGLNVQTVD 

SARS CoV —GVITHDV SSAINRPQIG WREFLTRNP AWRKAVFISP YNSQNAVASK ILGLPTQTVD 

....I I I.. ..| ....|....| ....|.... | ....|....| ....| I 

6065 6075 6085 6095 6105 6115 

EMCR SSQGSEYDYV IYAQTSDTAH ACNVNRFNVA ITRAKKGIFC VMCDKT-LFD SLKFFEIKHA 

229E SAQGSEYDYV IFAQTSDTAH ACNANRFNVA ITRAKKGIFC IMSDRT-LFD ALKFFEITMT 

PEDV SSQGSEYDYV IYAQTSDTAH ASNVNRFNVA ITRAKKGILC IMCDRS-LFD LLKFFELKLS 

TGEV SAQGSEYDYV IYTQTSDTQH ATNVNRFNVA ITRAKVGILC IMCDRT-MYE NLDFYELKDS 

OV43 SAQGSEYDYV IYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSNMQ-LFE ALQFTTLTLD 

BOCOV SAQGSEYDYV IYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSNMQtLFE ALQFTTLTVD 

MHV SAQGSEYDFV IYSQTAETAH SVNVNRFNVA ITRAKKGILC VMSSMQ-LFE SLNFSTLTLD 

AIBV SSQGSEYDYV I FCVTADSQH ALNINRFNVA LTRAKRGILV VMRQRDELYS ALKFTELDSE 

SARS CoV SSQGSEYDYV IFTQTTETAH SCNVNRFNVA ITRAKIGILC IMSDRD-LYD KLQFTSLEIP 

1 .... I -...I.. .-I ....I. ...| ....|....| ....|. ...| ....|. ...| 

6125 6135 6145 6155 6165 6175 

EMCR DLHSS -QVCGLFKNC TRTPLNLPPT HAHTFLSLSD QFKTTGDLAV QIGS-N-NVC 

229E DLQSE -SSCGLFKDC ARNPIDLPPS HATTYLSLSD RFKTSGDLAV QIGN-N-NVC 

PEDV DLQAN -EGCGLFKDC SRGDDLLPPS HANTFMSLAD NFKTDQYLAV QIGV-N-GPI 

TGEV KI GLQAK PETCGLFKDC SKSEQYIPPA YATTYMSLSD NFKTSDGLAV NIG — T-KDV 

OV43 KVPQAVETKV QCSTNLFKDC SKSYSGYHPA HAPSFLAVDD KYKATGDLAV CLGIGD-SAV 

BOCOV KVPQAVETRV QCSTNLFKDC SKSYSGYHPA HAPSFLAVDD KYKATGDLAV CLGIGD-SAV 

• MHV KIN NPRL QCTTNLFKDC SRSYAGYHPA HAPSFLAVDD KYKVGGDLAV CLNVAD-SAV 

AIBV T S— LQGTGLFKIC NKEFSGVHPA YAVTTKALAA TYKVNDELAA LVNVEAGSEI 

SARS CoV RRN-VATLQA ENVTGLFKDC SKIITGLHPT QAPTHLSVDI KFKTEG-LCV DIPGIP-KDM 

....I. ...I • ... I .... I | I ....|....| 

6185 6195 6205 6215 6225 6235 

EMCR TYEHVISFMG FRFDISIPGS HSLFCTRDFA IRNVRGWLGM DVESAHVCGD NIGTNVPLQV 

22 9E TYEHVISYMG FRFDVSMPGS HSLFCTRDFA MRHVRGWLGM DVEGAHVTGD NVGTNVPLQV 

PEDV KYEHVISFMG FRFDINIPNH HTLFCTRDFA MRNVRGWLGF DVEGAHVVGS NVGTNVPLQL 

TGEV KYANVISYMG FRFEANI PGY HTLFCTRDFA MRNVRAWLGF DVEGAHVCGD NVGTNVPLQL 

OV43 TYSRLISLMG FKLDVTLDGY CKLFITKEEA VKRVRAWVGF DAEGAHATRD SIGTNFPLQL 

BOCOV TYSRLISLMG FKLDVTLDGY CKLFITKEEA VKRVRAWVGF DAEGAHATRD SIGTNFPLQL 

MHV TYSRLISLMG FKLDLTLDGY CKLFITRDEA IRRVRAWVGF DAEGAHATRD SIGTNFPLQL 

AIBV TYKHLISLLG FKMSVNVEGC KNMFITRDEA IRNVRGWVGF DVEATHACGT NIGTNLPFQV 

SARS CoV TYRRLISMMG FKMNYQVNGY PNMFITREEA IRHVRAWIGF DVEGCHATRD AVGTNLPLQL 
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I I ....|....| ....|. ...I ....|....| 

6245 6255 6265 6275 6285 6295 

EMCR GF SNGVNFW QTEGCVSTNF GDVIKPVCAK SPPGEQFRHL VPFLRKGQPW LIVRRRIVQM 

229E GFSNGVDFVA QPEGCVLTNT GSWKPVRAR APPGEQFTHI VPLLRKGQPW SVLRKRIVQM 

PEDV GFSNGVDFW RPEGCWTES GDYIKPVRAR APPGSQFAHL LPLLKRGQPW DWRKRIVQM 

TGEV GFSNGVDFW QTEGCVITEK GNSIEWKAR APPGEQFAHL IPLMRKGQPW HIVRRRIVQM 

OV43 GFSTGIDFW EATGLFADRD GYSFKKAVAK APPGEQFKHL IPLMTRGHRW DWRPRIVQM 

BoCoV GFSTGIDFW EATGLFADRD GYSFKKAVAK APPGEQFKHL IPLMTRGQRW DWRPRIVQM 

MHV GFSTGIDFW EATGMFAERD GYVFKKAVAR APPGEQFKHL VPLMSRGQKW DWRIRIVQM 

AIBV GFSTGADFW TPEGLVDTSI GNNFEPVNSK APPGEQFNHL RVLFKSAKPW HVIRPRIVQM 

SARS COV GFSTGVNLVA VPTGYVDTEN NTSFTRVNAK PPPGDQFKHL IPLMYKGLPW NWRIKIVQM 

....I. ...| ....|....| ....|....| I I 

6305 6315 6325 6335 6345 6355 

EMCR ISDYLSNLSD ILVFVLWAGS LELTTMRYFV KIGP-IKYCY CGNSATCYNS VSHEYCCFKH 

229E IADFLAGSSD VLVFVLWAGG LELTTMRYFV KIGA-VKHCQ CGTVATCYNS VSNDYCCFKH 

PEDV CSDYLANLSD ILIFVLWAGG LELTTMRYFV KIGP-SKSCD CGKVATCYNS ALHTYCCFKH 

TGEV VCDYFDGLSD ILIFVLWAGG LELTTMRYFV KIGR-PQKCE CGKSATCYSS SQSVYACFKH 

OV43 FADHLIDLSD CVVLVTWAAN FELTCLRYFA KVGREISCNV CTKRATVYNS RTGYYGCWRH 

BoCoV FADHLIDLSD CVVLVTWAAN FELTCLRYFA KVGREISCNV STKRATAYNS RTGYYGCWRH 

MHV LSDHLVDLAD SVVLVTWAAS FELTCLRYFA KVGKEVVCSV CNKRATCFNS RTGYYGCWRH 

AIBV LADNLCNVSD CWFVTWCHG LELTTLRYFV KIGK-EQVCS CGSRATTFUS HTQAYACWKH 

SARS CoV LSDTLKGLSD RVVFVLWAHG FELTSMKYFV KIGPERTCCL CDKRATCFST SSDTYACWNH 

I I ....I.... I ....|.... | ,...|....| ....|.... | 

6365 6375 6385 6395 6405 6415 

EMCR ALGCDYVYNP YAFDIQQWGY VGSLSQNHHT FCNIHRNEHD ASGDAVMTRC LAVHDCFVKN 

22 9E ALGCDYVYNP YVIDIQQWGY VGSLSTNHHA ICNVHRNEHV ASGDAIMTRC LAVYDCFVKN 

PEDV ALGCDYLYNP YCIDIQQWGY KGSLSLNHHE HCNVHRNEHV ASGDAIMTRC LAIHDCFVKN 

TGEV ALGCDYLYNP YCIDIQQWGY TGSLSMNHHE VCNIHRNEHV ASGDAIMTRC LAIHDCFVKR 

OV43 SVTCDYLYNP LIVDIQQWGY IGSLSSNHDL YCSVHKGAHV ASSDAIMTRC LAVYDCFCNN 

BOCOV SVTCDYLYNP LIVDIQQWGY IGSLSSNHDL YCSVHKGAHV ASSDAIMTRC LAVYDCFCNN 

MHV SYSCDYLYNP LIVDIQQWGY TGSLTSNHDL ICSVHKGAHV ASSDAIMTRC LAVHDCFCKS 

AIBV CLGFDFVYNP LLVDIQQWGY SGNLQFNHDL HCNVHGHAHV ASVDAIMTRC LAINNAFCQD 

SARS CoV SVGFDYVYNP FMI DVQQWGF TGNLQSNHDQ HCQVHGNAHV ASCDAIMTRC LAVHECFVKR 

,...|....| ....|....| ....|....| . ... | .... I I I ....I. ...I 

6425 6435 6445 6455 6465 6475 

EMCR VDWTVTYPFI ANEKFINGCG RNVQGHVVRA ALKLYKPSVI HDIGNPKGVR CA-VTDAKWY 

229E VDWSITYPMI ANENAINKGG RTVQSHIMRA AIKLYNPKAI HDIGNPKGIR CA-VTDAKWY 

PEDV VDWSITYPFI GNEAVINKSG RIVQSHTMRS VLKLYNPKAI YDIGNPKGIR CA-VTDAKWF 

TGEV VDWSIVYPFI DNEEKINKAG RIVQSHVMKA ALKIFNPAAI HDVGNPKGIR CA-TTPIPWF 

OV43 INWNVEYPII SNELSINTSC RVLQRVILKA AMLCNRYTLC YDIGNPKAIA CV — KDFDFK 

BoCoV INWNVEYPII SNELSINTSC RVLQRVMLKA AMLCNRYTLC YDIGNPKAIA CV — KDFDFK 

MHV VNWSLEYPII SNEVSVNTSC RLLQRVMFRA AMLCNRYDVC YDIGNPKGLA CV — KGYDFK 

AIBV VNWDLTYPHI ANEDEVNSSC RYLQRMYLNA CVDALKVNW YDIGNPKGIK CVRRGDVNFR 

SARS COV VDWSVEYPII GDELRVNSAC RKVQHMVVKS ALLADKFPVL HDIGNPKAIK CVPQAEVEWK 

....I.. ..I ....|. ...| . ... I .... | ....|.... I I .... I ....|....| 

6485 6495 6505 6515 6525 6535 

EMCR CYDKQPVNSN VKLLDYD YATHG — QLD GLCLFWNCNV DMYPEFSIVC RFDTRTRSVF 

229E CYDKNPINSN VKTLEYD YMTHG — QMD GLCLFWNCNV DMYPEFSIVC RFDTRTRSTL 

PEDV CFDKNPTNSN —VKTLEYD YITHG — QFD GLCLFWNCNV DMYPEFSWC RFDTRCRSPL 

TGEV CYDRDPINNN VRCLDYD YMVHG — QMN GLMLFWNCNV DMYPEFSIVC RFDTRTRSKL 

OV43 FYDAQPIVKS VKTLLYS FEAHKDSFKD GLCMFWNCNV DKYPPNAWC RFDTRVLNNL 

BOCOV FYDAQPIVKS VKTLLYF FEAHKDSFKD GLCMFWNCNV DKYPPNAWC RFDTRVLNNL 

MHV FYDASPVVKS VKQFVYK YEAHKDQFLD GLCMFWNCNV DKYPANAWC RFDTRVLNKL 

AIBV FYDKNPIVRN VKQFEYD YNQHKDKFAD GLCMFWNCNV DCYPDNSLVC RYDTRNLSVF 

SARS CoV FYDAQPCSDK AYKIEELFYS YATHHDKFTD GVCLFWNCNV DRYPANAIVC RFDTRVLSNL 

• • ■ ■ I I ....I.... I ....|....| ....I.... | ....I.... I ....|.... | 

6545 6555 6565 6575 6585 6595 

EMCR NLEGVNGGSL YVNKHAFHTP AYDKRAFVKL KPMPFFYFDD SDCDWQ EQVNYVPLR 

229E NLEGVNGGSL YVNNHAFHTP AYDKRAMAKL KPAPFFYYDD GSCEVVH DQVNYVPLR 

PEDV NLEGCNGGSL YVNNHAFHTP AFDKRAFAKL KPMPFFFYDD TECDKLQ DSINYVPLR 

TGEV SLEGCNGGAL YVNNHAFHTP AYDRRAFAKL KPMPFFYYDD SNCELVD GQPNYVPLK 

OV43 NLPGCNGGSL YVNKHAFHTK PFARAAFEHL KPMPFFYYSD TPCVYMDGMD AKQVDYVPLK 

BoCoV NLPGCNGGSL" YVNKHAFHTK PFSRAAFEHL KPMPFFYYSD TPCVYMDGMD AKQVDYVPLK 

MHV NLPGCNGGSL YVNKHAFHTS PFTRAAFENL KPMPFFYYSD TPCVYMEGME SKQVDYVPLR 

AIBV NLPGCNGGSL YVNKHAFYTP KFDRISFRNL KAMPFFFYDS SPCETIQVDG -VAQDLVSLA 

SARS CoV NLPGCDGGSL YVNKHAFHTP AFDKSAFTNL KQLPFFYYSD SPCESHGKQV VSDIDYVPLK 

....I. ...I ....(. ...| - ... { .... I ....|.... | ....|.... I ....|....| 
6605 6615 6625 6635 6645 6655 

EMCR ASSCVTRCNI GGAVCSKHAN LYQKYVEAYN TFTQAGFNIW VPHSFDVYNL WQIFIET-NL 

229E ATNCITKCNI GGAVCSKHAN LYRA YVES YN IFTQAGFNIW VPTTFDCYNL WQTFTEV-NL 

PEDV ASNCITKCNV GGAVCSKHCA MYHSYVNAYN TFTSAGFTIW VPTSFDTYNL WQTFSN — NL 

TGEV SNVCITKCNI GGAVCKKHAA LYRAYVEDYN IFMQAGFTIW CPQNFDTYML WHGFVNSKAL 

OV43 SATCITRCNL GGAVCLKHAE EYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTK L 

BoCOV SATCITRCNL GGAVCLKHAE EYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTK L 

MHV SATCITRCNL GGAVCLKHAE DYREYLESYN TATTAGFTFW VYKTFDFYNL WNTFTR L 

AIBV TKDCITKCNI GGAVCKKHAQ MYAEFVTSYN AAVTAGFTFW VTNKLNPYNL WKSFSA L 

SARS CoV SATCITRCNL GGAVCRHHAN EYRQYLDAYN MMISAGFSLW IYKQFDTYNL WNTFTR— L 



SUBSTITUTE SHEET (RULE 26) 



WO 2005/049814 



75/87 



PCT/NL2004/000805 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BOCOV 

MHV 

AIBV 

SARS CoV 



....|....| 

6665 
QSLENIAFNV 
QGLENIAFNV 
QGLENIAFNV 
QSLENVAFNV 
QSLENWYNL 
QSLENWYNL 
QSLENWYNL 
QSIONIAYNM 
QSLENVAYNV 



6675 
VKKGCFTGVD 
VNKGSFVGAD 
LKKGSFVGDE 
VKKGAFTGLK 
VKTGHYTGQA 
VKTGHYTGQA 
VNAGHFDGRA 
YKGGHYDAIA 
VNKGHFDGHA 



• • • - I I 

6685 
GELPVAWND 
GELPVAISGD 
GELPVAWND 
GDLPTAVIAD 
GEMPCAIIND 
GEMPCAIIND 
GELPCAVIGE 
GEMPTVITGD 
GEAPVSIINN 



6695 
KVFVRYGDVD 
KVFVRDGNTD 
KVLVRDGTVD 
KIMVRDGPTD 
KWAKIDKED 
KWAKIDKED 
KVIAKIQNED 
KVFVIDQGVE 
AVYTKVDGID 



. ... I .... I 

6705 
NLVFTNKTTL 
NLVFVNKTSL 
TLVFTNKTSL 
KCIFTNKTSL 
WIFINNTTY 
WIFINNTTY 
WVFKNNTPF 
KAVFVNQTTL 
VEIFENKTTL 



I I 

6715 
PTNVAFELFA 
PTNIAFELFA 
PTNVAFELYA 
PTNVAFELYA 
PTNVAVELFA 
PTNVAVELFA 
PTNVAVELFA 
PTSVAFELYA 
PVNVAFELWA 



EMCR 

229E 

PEDV 

TGEV 

OV43 

BoCoV 

MHV 

AIBV 

SARS CoV 



....|.... I 

6725 
KRKMGLTPPL 
KRKVGLTPPL 
KRKVGLTPPI 
KRKLGLTPPL 
KRSVRHHPEL 
KRSIRHHPEL 
KRSIRPHPEL 
KRNIRTLPNN 
KRNIKPVPEI 



■ ... I - I 

6735 
SILKNLGWA 
SILKNLGWA 
TILRNLGWC 
TILRNLGWA 
KLFRNLNIDV 
KLFRNLNIDV 
KLFRNLNIDV 
RILKGLGVDV 
KILNNLGVDI 



6745 
TYKFVLWDYE 
TYKFVLWDYE 
TSKCVIWDYE 
TYKFVLWDYE 
CWKHVIWDYA 
CWKHVIWDYA 
CWSHVLWDYA 
TNGFVIWDYA 
AANTVIWDYK 



....|....| 

6755 
AERPFTSYTK 
AERPLTSFTK 
AERPLTTFTK 
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BoCoV EIPQFGTGVL IACLIWLNSR LSWLVMP 

MHV PLKVAGTAVV SLKPDQINDL VLSLIEKGKL LVRDTRKEVF VGDSLVNVK- 

AIBV DLRLKATPVV NLKTEQKTDL VFNLIKCGKL LVRDVGNTSF TSDSFVCTM- 

SARS COV PLKLRGTAVM SLKENQINDM IYSLLEKGRL IIRENNRVW SSDILVNN— 
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PTTSGVSSPQ 


YWVTPLVKRQ 


YLFNFNQKGI 


ITSAVDCASS 


YTSEIKCKTQ 


SMNPNTG-VY 


PHEV 


S ALSLE 


YWVT PLTTRQ 


FLLAFDQDGV 


LYHAVDCASD 


FMSEIMCKTS 


SITPPTG-VY 


AIBV 


G SSS 


GCTVGIIHGG 


RVVNASSIAM 


TAPSSGMAWS 


SSQFCTAHCN 


FSDTTVFVTH 


SARS 


QDIWGTSAAA 


YFVGYLKPTT 


FMLKYDENGT 
| | 


ITDAVDCSQN 
.... | | 


PLAELKCSVK 
,...|....| 


SFEIDKG-IY 
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EMCR S 


NCNGYQHNSV 


ADVMRYNLNL 


SANSVDNLKS 


GVIVFKTLQY 


-DVLFYCSN- 


--SS-SGVLD 


229E S 


DCKGFSSDVL 


SDVIRYNLNF 


EEN LRR 


GTILFKTSYG 


-VWFYCTN- 


— NT-LVSGD 


PEDV 


VCNGAAVDRA 


PEALRFNIND 


TSV ILAE 


GSIVLHTALG 


TNLSFVCSN- 


— SSDPHLAI 


TGEV 


QCNGAVLNNT 


VDVIRFNLNF 


TTNVQSGKGA 


TVFSLNTTGG 


VTLEISCY— 


TVSDSSFFSY 


CaCoV 


QCNGVSLNNT 


VDVIRFNLNF 


TTDVQSGMGA 


TVFSLNTTGG 


VILEISCYND 


TVSESSFYSY 


FeCoV 


QCNGVSLNNT 


VDVIRFNLNF 


TADVQSGMGA 


TVFSLNTTGG 


VILEISCYSD 


TVSESSSYSY 


Por Resp C 


QCNGAVLNNT 


VDVIRFNLNF 


TTNVQSGKGA 


TVFSLNTTGG 


VTLEISCYND 


TVSDSSFSSY 


OC43 


ELNGYTVQPI 


ADVYRRKLNL 


PNCNIEAWLN 


DKSVPSPLNW 


ERKTFSNCNF 


NMSSLMSFIQ 


BOCOV 


ELNGYTVQPI 


ADVYRRIPNL 


PDCNIEAWLN 


DKSVPSPLNW 


ERKTFSNCNF 


NMSSLMSFIQ 


MHV 


DLSGYTVQPV 


GLVYRRVRNL 


PDCKIEEWLT 


AKSVPSPLNW 


ERKTFQNCNF 


DLSSLLRFVQ 


Rat CoV 


DLSGYTVQPV 


GLVYRRVRNL 


PDCKIEEWLA 


ANTVPSPLNW 


ERKTFQNCNF 


NLSSLLRFVQ 


PHEV 


ELNGYTVQPV 


ATVYRRIPDL 


PNCDIEAWLN 


SKTVSSPLNW 


ERKIFSNCNF 


NMGRLMSFIQ 


AIBV 


CYKHGGCPLT 


GMLQQNLIRV 


SAMKNGQLFY 


NLTVSVAKYP 


TFRSFQCVN- 


— NLTSVYLN 


SARS 


QTSNFRVVPS 
| | 


GDVVRFPNIT 
| | 


NLCPFGEVFN 
.... | | 


ATKFPSVYAW 
| | 


ERKKISNCVA 


DYSVLYNSTF 
....I....I 
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465 
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EMCR S 


TTIPFGPSSQ 


PYYCFINSTI 


NTTHVSTFVG 


ILPPTVREIV 


VARTGQFYIN 


GFKYFDLGFI 


229E S 


AHIPFGTVLG 


NFYCFVNTTI 


GNETTSAFVG 


ALPKTVREFV 


ISRTGHFYIN 


GYRYFTLGNV 


PEDV 


FAIPLGATEV 


PYYCFLKVDT 


YNSTVYKFLA 


VLPPTVREIV 


ITKYGDVYVN 


GFGYLHLGLL 


TGEV 


GEIPFGVTDG 


PRYCYV H 


YNGTALKYLG 


TLPPSVKEIA 


ISKWGHFYIN 


GYNFFSTFPI 


CaCoV 


GEIPFGVTDG 


PRYCYV L 


YNGTALKYLG 


TLPPSVKEIA 


ISKWGHFYIN 


GYNFFSTFPI 


FeCoV 


GEIPFGITDG 


PRYCYV L 


YNGTALKYLG 


TLPPSVKEIA 


ISKWGHFYIN 


GYNFFSTFPI 


Por Resp C 


GEIPFGVTNG 


PRYCYV- — L 


YNGTALKYLG 


TLPPSVKEIA 


ISKWGHFYIN' 


GYNFFSTFPI 


OC43 


ADSFTCNNID 


AAKIYG — MC 


FSSITIDKFA 


IPNGRKVDLQ 


LGNLGYLQSF 


NYRIDTTATS 


BOCOV 


ADSFTCNNID 


AAKIYG — MC 


FSSITIDKFA 


IPNGRKVDLQ 


LGNLGYLQSF 


NYRIDTTATS 


MHV 


AESLSCSNID 


ASKVYG--MC 


FGSISIDKFA 


IPNRRRVDLQ 


LGNSGFLQSF 


NYKIDTRATS 


Rat CoV 


AESLSCSNIO 


ASKVYG— MC 


FGSISIDKFA 


IPNSRRVDLQ 


LGKSGLLQSF 


NYKIDTRATS 


PHEV 


ADSFGCNNID 


ASRLYG--MC 


FGSITIDKFA 


IPNSRKVDLQ 


VGKSGYLQSF 


NYKIDTAVSS 


AIBV 


GDLVYTSNET 


IDVTSAG--V 


YFKAGGPITY 


KVMREVKALA 


YFVNGTAQDV 


ILCDGSPRGL 


SARS 


FSTFKCYGVS 


ATKLND — LC 


FSNVYADSFV 


VKGDDVRQIA 


PGQTGVIADY 


NYKLPDDFMG 
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EMCR S 


EAVNFNVT-- 


--TASATDFW 


TVAFATFVDV 


LVNVSATNIQ 


NLLYCDSPFE 


KLQCEHLQFG 


229E S 


EAVNFNVT — 


— TAETTDFC 


TVALASYADV 


LVNVSQTSIA 


NIIYCNSVIN 


RLRCDQLSFD 


PEDV 


DAVTINFTGH 


GTDDDVSGFW 


TIASTNFVDA 


LIEVQGTSIQ 


RILYCDDPVS 


QLKCSQVAFD 


TGEV 


DCISFNLT — 


— TGDSDVFW 


TIAYTSYTEA 


LVQVENTAIT 


KVTYCNSHVN 


NIKCSQITAN 


CaCoV 


DCIAFNLT — 


— TGASGAFW 


TIAYTSYTEA 


LVQVENTAIK 


KVTYCNSHIN 


NIKCSQLTAN 


FeCoV 


GCISFNLT — 


— TGVSGAFW 


TIAYTSYTEA 


LVQVENTAIK 


NVTYCNSHIN 


NIKCSQLTAN 


Por Resp C 


DCISFNLT— 


— TGDSDVFW 


TIAYTSYTEA 


LVQVENTAIT 


NVTYCNSYVN 


NIKCSQLTAN 


OC43 


CQLYYNLP — 


AANVSVS 


RFNPSTWNKR 


FGFIEDSVFK 


PRPAGVLTNH 


DWYAQHCFK 


BOCOV 


CQLYYNLP-- 


AANVSVS 


RFNPSTWNRR 


FGFTEQFVFK 


PQPVGVFTHH 


DWYAQHCFK 


MHV 


CQLYYSLA — 


KNNVTVN 


NHNPSSWNRR 


YGFND 


-VATFGTGKH 


DVAYAEACFT 


Rat Cov 


CQLYYSLA-- 


QDNVTVI 


NHNPSSWNRR 


YGFND 


-VATFHSGEH 


DVAYAEACFT 


PHEV 


CQLYYSLP— 


AANVSVT 


HYNPSSWNRR 


YGFNN 


-QSFGSRGLH 


DAVYSQQCFN 


AIBV 


LACQYNTG— 


NFSDGFY 


PFTNSSLVKQ 






TCTLHNFIFH 


SARS 


CVLAWNTR — 


NID 


ATSTGNYNYK 
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545 
LQDGFY--SA 
VPDGFY--ST 
LDDGFYPISS 
LNNGFYPVSS 
LQNGFYPVAS 
LNNGFYPVAS 
LNNGFYPVSS 
APKNFCPCKL 
APKNFCPCKL 
VGASYCPCAN 
VGASYCPCAK 
TPNTYCPCRT 
NETGANPNPS 
KLRPFER 
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555 
— NFLDDNVL 
— SPIQSVEL 
— RNLLSHEQ 
— SEVG--LV 
— SEVG--LV 
--SEVG--FV 
— SEVG--SV 
NGS-CVGSGP 
DGSLCVGNGP 
P-SIVSPCTT 
P-STVYSCVT 
— SQCIG-— 
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565 



-ET 
-VS 
-IS 

-KS 



N KS 

N KS 

N KS 

G KNNG 

GIDAGYKNSG 

G K-PN 

G K-PK 

G AG 

G 

D 
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575 
YVALPIYYQH 
IVSLPVYHKH 
FVTLPSFNDH 
WLLPSFYTH 
WLLPSFYSH 
WLLPSFFTY 
WLLPSFLTH 
IGTCPAGTNY 
IGTCPAGTNY 
FANCPTGTSN 
SANCPTGTSN 
TGTCPVGTTV 
VQNIQTYQTK 
ISNVPFSPDG 
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585 
TDINFTATA- 
TFIVLYVDFK 
SFVNITVSA- 
TIVNITIGLG 
TSVNITIDLG 
TAVNITIDLG 
TIVNITIGLG 

LTCDN 

LTCHNAA 

RECTVMPLAN 
RECNVQASG- 
RKCFAAVTK- 
TAQSGYYNFN 
KPCTP 
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595 
--SFGGSCYV 
PQSGGGKCFN 
--AFGG-LSS 
-MKRSGYGQP 
-MKRS-VTVT 
-MKLSGYGQP 
-MKRSGYGQP 

LC 

QCDCLC 

-NQFKCDCTC 
-FKSKCDCTC 
--ATKCTCWC 
FSF 
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605 
CKPRQVNISL 
CYPAGVNITL 
ANLVASDTTI 
IASTLSNITL 
IASPLSNITL 
IASTLSNITL 
IASTLSNITL 
TPDPITFKAT 
TPDPITSKST 
NPSPLTTYDL 
NPSPLTTYDP 
QPDPSTYKGV 
LSSFVYKESN 
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615 625 635 645 655 

N GNTSV CVRTSHFSIR YIYNRVKSGS P G DSSWHIYLKS 

ANFNETKGPL CVDTSHFTTK YVAVYANVG RWSASINT 

N GFSSF CVDTRQFTIT LFYNVTNSYG YVSKSQD 

PMQDHNTDVY CIRSDQFSVY VHSTCKSALW DNIFKRNCTD VLDATAVIKT 

PMQDNNIDVY CIRSNQFSVY VHSTCKSSLW DNNFNSACTD VLDATAVIKT 

PMQDNNTDVY CIRSNQFSVY VHSTCKSSLW DNIFNQDCTD VLEATAVIKT 

PMQDNNNDVY CVRSDQFSVY VHSTCKSVLW DNVFKRNCTD VLDATAVIKT 

GTYKCPQTKS LVGIGEHCSG LAVKSDYCGG N SCTCRPQAFL 

GPYKCPQTKY LVGIGEHCSG LAIKSDYCGG N PCTCQPQAFL 

— R-CLQARS MLGVGDHCEG LGVLEDKCGG S N TCNCSAHAFV 

— R-CLQARS MLGVGDHCEG LGILEDKCGG S N ICNCSADAFV 

NAWTCPQSKV SIQPGQHCPG LGLVEDDCSG N PCTCKPQAFI 

FMYGSYHPSC KFRLETINNG LWFNSLSVS IAYGPLQ 

p — AL NCYWPLNDY G FYTTTGI 
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665 675 685 695 705 715 

EMCR S GTCPFSFSKL NNFQKFKTIC FSTVEVPGSC NFPLEATW — HYTSYTIVGA LYVTWSEGNS 

229E S GNCPFSFGKV NNFVKFGSVC FSLKDIPGGC AMPIVANW— AYSKYYTIGS LYVSWSDGDG 

PEDV SNCPFTLQSV NDYLSFSKFC VSTSLLAGAC TIDLFGYP— AFGSGVKLTS LYFQFTKGEL 

TGEV GTCPFSFDKL NNYLTFNKFC LSLSPVGANC KFDVAAR TRTNEQVVRS LYVIYEEGDN 

CaCoV GTCPFSFDKL NNYLTFNKFC LSLNPVGANC KLDVAAR — - TRTNEQVFGS LYVIYEEGDN 

FeCoV GTCPFSFDKL NNYLTFNKFC LSLSPVGANC KFDVAAR— TRTNEQVVRS LYVIYEEGDN 

Por Resp C GTCPFSFDKL NNYLTFNKFC LSLSPVGANC KFDVAAR— TRTNDQVVRS LYVIYEEGDS 

OC43 GWSADSCLQG DKCNIFANFI LHDVNSGLTC STDLQKANTD IILGVCVNYD LYGILGQGIF 

BoCoV GWSVDSCLQG DRCNIFANFI FHDVNSGTTC STDLQKSNTD IILGVCVNYD LYGITGQGIF 

MHV GWAKDSCLAN GRCHIFSNLM LNGINSGTTC SMDLQLPNTE WTGVCVKYD LYGITGQGIF 

Rat COV GWAMDSCLSN ARCHIFSNLM LNGINSGTTC STDFQLPNTE WTGVCVKYD LYGSTGQGVF 

PHEV GWSSETCLQN GRCNIFANFI LNDVNSGTTC STDLQQGNTI ITTDVCVNYD LYGITGQGIL 

AIBV GGCKQSVFKG RATCCYAYSY GGPSLCKGVY SGELDHN FECGLLV YVTKSGGSRI 

SARS GYQPYRVVVL S FELLN APATVCGPKL STDLIKN QCVNFN FNGLTGTG-V 
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725 735 745 755 765 775 

EMCR S ITGVPYPVSG IREFSNLVLN NCTKYNIYDY VGTGIIRSSN QSLAGGITYV S 

22 9E S ITGVPQPVEG VSSFMNVTLD KCTKYNIYDV SGVGVIRVSN DTFLNGITYT S 

PEDV ITGTPKPLEG ITDVSFMTLD VCTKYTIYGF KGEGIITLTN SSILAGVYYT S 

TGEV IVGVPSDNSG VHDLSVLHLD SCTDYNIYGR TGVGI I RQTN RTLLSGLYYT S 

CaCoV IVGVPSDNSG LHDLSVLHLD SCTDYNIYGR TGVGI I RKTN STLLSGLYYT S 

FeCoV IVGVPSDNSG LHDLSVLHLD SCTDYNIYGR TGVGI I RRTN STLLSGLYYT S 

Por Resp C IVGVPSDNSG LHDLSVLHLD SCTDYNIYGR TGVGI I RQTN RTILSGLYYT S 

OC43 VEVNATYYNS WQNLLYDSNG NLYGFRDYIT NRTFMIRSCY SGRVSAAFHA N 

BoCoV VEVNATYYNS WQNLLYDSNG NLYGFRDYLT NRTFMIRSCY SGRVSAAFHA N 

MHV KEVKADYYHS WQNLLYDVNG NLIGFRDFVA NKSYTIRSCY SGRVSAAYHQ D 

Rat COV KEVKADYYNS WQNLLYDVNG NLNGFRDIVT NKTYLLRSCY SGRVSAAYHQ D 

PHEV IEVNATYYNS WQNLLYDSSG NLYGFRDYLS NRTFLIRSCY SGRVSAVFHA N 

AIBV QTATEPPVIT QNNYNNITLN TCVDYNIYGR TGQGFITNVT DSAVSYNYLA DAGLAILDTS 

SARS LTPSSKRFQP FQQFGRDVSD FTDSVRDPKT SEILDISPCS FGGVSVITPG TN A 

. ... I .... I ....I.... I ....|.... | ....|....| ....|.... | ....|....| 
785 795 805 815 825 835 

EMCR S NSGNLLGFKN VSTGNIFIVT PCNQPDQVAV YQQ-SIIGAM TAVNESRYGL QNLLQLPNFY 

229E S TSGNLLGFKD VTKGTIYSIT PCNPPDQLW YQQ-AWGAM LSENFTSYGF SNWELPKFF . 

PEDV DSGQLLAFKN VTSGAVYSVT PCSFSEQAAY VND-DIVGVI SSLSNS — TF NNTRELPGFF 

TGEV LSGDLLGFKN VSDGVIYSVT PCDVSAQAAV IDG-TIVGAI TSINSELLGL THWTTTPNFY 

CaCoV LSGDLLGFKN VSDGWYSVT PCDVSAQAAV IDG-AIVGAM TSINSELLGL THWTTTPNFY 

FeCoV LSGDLLGFKN VSDGVIYSVT PCDVSAQAAV IDG-AIVGAM TSINSELLGL THWTTTPNFY 

Por Resp C LSGDLLGFTN VSDGVIYSVT PCDVSAQAAI IDG-TIVGAI TSINSELLGL THWTTTPNFY 

OC43 SSEPALLFRN IKCNYVFNNS LTRQLQPINY FDS-YLGCW NAYNSTAISV QTCDLTVGSG 

BoCoV SSEPALLFRN IKCNYVFNNT LSRQLQPINY FDS-YLGCW NADNSTSSW QTCDLTVGSG 

MHV APEPALLYRN LKCDYVFNNN ISREETPLNY FDS-YLGCW NADNSTEEAV DACDLRMGSG 

Rat COV APEPALLYRN LKCDYVFNNN ISREETPLNY FDS-YLGCVI NADNSTEQSV DACDLRMGSG 

PHEV SSEPALMFRN LKCSHVFNNT ILRQIQLVNY FDS-YLGCW NAYNNTASAV STCDLTVGSG 

AIBV GSIDIFWQG EYGLNYYKVN PCEDVNQQFV VSGGKLVGIL TSRNETGSQL LENQFYIKIT 

SARS SSEVAVLYQD VNCTDVSTAI HADQLTPAWR IYS-TGNNVF QTQAGCLIGA EHVDTSYECD 
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845 

YVS NG 

YAS NG 

YHS ND 

YYSI YNY 

YYSI YNY 

YYSI YNY 

YYSI— YNY 

YCVD YSK 

YCVD YST 

LCVN — -YST 
LCVN — -YSI 

YCVD YVT 

NGTRRFRRSI 
IPIG AG I 
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GNK 

TYN 

GSN 



S65 

CTTAV 

CTDAV 

CTEPV 



TNDRTRGTAI DSNDVDCEPV 
TNVMNRGTAI D-NDIDCEPI 
TSERTRGTAI DSNDVDCEPV 
TNDKTRGTPI GSNDVDCEPV 

NRR SRGAI 

KRR SRRAI 

SHR — ARSSV 

AHR ARRSV 

ALR SRRSF 

TEN VANCPY 

CAS YHTVSL 
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-ITANLSIPS 
-VTANLSIPS 
-VTGNISIPT 
-STGNVTIPT 
-STGNVTIPT 
-STGNVTIPT 
-STGNVTIPT 
-GLYEIQIPS 
-GLYEIQIPS 
-GLYELQIPT 
-GLYEMQIPT 
-GLYEIQIPS 
NVTENVLIPN 
-SNNTIAIPT 
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NWTTSVQVEY 
NWTTSVQVEY 
NFSMSIRTEY 
NFTISVQVEY 
NFTISVQVEY 
NFTISVQVEY 
NFTISVQVEY 
EFTIGNMEEF 
EFTIGNMEEF 
NFTIASHQEF 
NFTIASHQEF 
EFTIGNLEEF 
SFNLTVTDEY 
NFSISITTEV 
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LQITSTPIW 
LQITSTPIW 
LQLYNTPVSV 
IQVYTTPVSI 
IQVYTTPVSI 
MQVYTTPVSI 
IQVYTTPVSI 
IQTSSPKVTI 
IQTSSPKVTI 
VQTRSPKVTI 
IQTRSPKVTI 
IQTRSPKVTI 
IQTRMDKVQI 
MPVSMAKTSV 
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875 
MIYSNFGICA 
LTYSSFGVCA 
LVYSNIGVCK 
ITYSNIGVCK 
ITYSNIGVCK 
ITYSNIGVCK 
ITYSNIGVCK 
TTGYRFTNFE 
TTGYRFTNFE 
STGYKLTTFE 
STGYKLTTFE 
TTGYRFTNFE 
VSYGKFCIKP 
LRSTSQKSIV 

I 
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DCATYVCNGN 
DCSTYVCNGN 
DCATYVCNGN 
DCSRYVCNGN 
DCARYVCNGN 
DCARYVCNGN 
DCSRYVCNGN 
DCAAFVCGDY 
DCSAFVCGDY 
DCAAFVCGGH 
DCAAFVCGDY 
DCATFVCGDY 
NCLQYVCGSS 
DCNMYICGDS 
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DGSLIPVRPR 
DGSIIAVQPR 
SGSIGYV-PS 
NGAFVFIN-V 
NGALVFIN-V 
NGALVFIN-V 
NGALVFIN-V 
PFTVNSVN-- 
PFTVNSVN— 
PFTVRIVN— 
PFTVSIVN— 
PFAANLVN-- 
DGSIATIVPK 
AYTMSLG— 
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NSSDNGISAI 
NVSYDSVSAI 
QYGQVKIAPT 
THSDGDVQPI 
THSDGDVQPI 
THSDGDVQPI 
THSDGDVQPI 

DSLEPVG 

DSLEPVG 

DSVESVD 

— DSVESVG 
— -DSIEPVG 
QLEQFVAPLF 
ADSSIAY 
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PRCKNLLKQY 
VRCVELLKQY 
SRCKQLLTQY 
PRCNKLLTQY 
PRCNKLLTQY 
PRCNKLLTQY 
PRCNKLLTQY 
AACKSQLVEY 
AACKSQLVEY 
TACRQQLVEY 
TACRQQLVDY 
AACRQQLAEY 
LDCRKLFQQY 
TECANLLLQY 
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TSACKTIEDA 
TSACKTIEDA 
TAACKTIESA 
VSACQTIEQA 
VSACQTIEQA 
VSACQTIEQA 
VSACQTIEQA 
GSFCDNINAI 
GSFCDNINAI 
GSFCDNINAI 
GSFCDNINAI 
GSFCENINAI 
GPVCDNILSV 
GSFCTQLNRA 
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LRLSAHLETN 
LRNSARLESA 
LQLSARLESV 
LAMGARLENM 
LAMGARLENM 
LAMGARLENM 
LAMGARLENM 
LTEVNELLDT 
LTEVNELLDT 
LGEVNNLIDT 
LGEVNNLIDT 
LTEVNELLDT 
VNSVGQKEDM 
LSGIAAEQDR 
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DVSSMLTFDS 
DVSEMLTFDK 
EVNSMLTISE 
EVDSMLFVSE 
EIDSMLFVSE 
EVDSMLFVSE 
EVDSMLFVSE 
TQLQVANSLM 
TQLQVANSLM 
MQLQVASALI 
MQLQVASALI 
TQLQVANSLM 
ELLNFYSSTK 
NTREVFAQVK 
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RNIHSS 

LPTSGS 

V — YDPASGR 
LKYILPSHNS 
LKDILPSHNS 
LKDILPSHNS 
LKYILPSDNS 
-SECSKASS- 
-SACNKVSS- 
-SDCGEVTMA 
-SDCSEGTKA 
-SECNRAST- 

PSSRR 

LKPTK- 
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RIAGRSALED 
RVAGRSAIED 
VVQKRSVIED 
KRKYRSAIED 
KRKYRSAIED 
KRKYGSAIED 
KRKYRSAIED 

RSAIED 

RSAIED 

AQTGRSAIED 
AQ-GRSAIED 

RSAIED 

K RSLIED 

RSFIED 
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NA-FSLANVT 
KA-FTLANVS 
EA-LQLATIS 
NA-LKLASVE 
NA-LKLASVE 
NA-LKLASVE 
NA-LKLASVE 
NG-VTLSTKL 
NG-VTLSTKL 
QG-VTLSSRL 
QG-VTLSSRL 
NG-VTLSTKI 
PAGFNTPVLS 
QM-YKTPTLK 

I 
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IMVLPGVADA 
IMVLPGVADA 
VMVLPGVVDA 
IMVLPGVANA 
IMVLPGVAND 
IMVLPGVANA 
IMVLPGVANA 
IKVLPPLLSE 
IKVLPPLLSV 
IKVLPPVLSE 
IKVLPPVLSE 
IKVLPPLLSE 
LLVLPPIITA 
LTVLPPLLTD 
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ERMAMYTGSL 
ERMAMYTGSL 
EKLHMYSASL 
DKMTMYTASL 
DKMTMYTASL 
DKMTMYTASL 
DKMTMYTASL 
NQISGYTLAA 
NQISGYTLAA 
NQISGYTAGA 
SQISGYTAGA 
NQISGYTLAA 
EMQALYTSSL 
DMIAAYTAAL 
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LLFSKWTSG 
ILFSKLVTSG 
LLFNKWTNG 
LLFDKWTSG 
LLFDKVVTSG 
LLFDKWTSG 
LLFSKWTSG 
LLFDKVKLSD 
LLFSKVKLSD 
VLFDKVKLSD 
VLFDKVKLSD 
LLFDKVKLSD 
LLFTSVESVG 
LLFNKVTLAD 



.... I I 

995 

SFG D 

SFG 0 

SFNG DG 

AFN— — SS 

A FN ST 

AFN ST 

AFN SS 

KDGVNFNVDD 
KDGVNFNVDD 
SDGIGGQIDD 
ADGISGQIDD 
KDGINFNVDD 

NVSTG E 

YFG G 



....|.. ..I ....|....| 
1005 1015 

YNLSSVLPQ 

YNLSSVIPS 

YNFTNVLGAS 

ETLDPIYKEW PNIGGSWLEG 
ENLDPIYKEW PNIGGSWLGG 
ENLDPIYKEW PSIGGSWLGG 
ETLDPIYKEW PNIGGFWLEG 
INFSPVLGCL G 



INFSPVLGCL 
INFSPLLGCL 
INFSPLLGCL 
INFSPVLGCL 
FNISLLLTN- 
FNFSQILPDP 



1055 
LGTVDVDYKS 
LGTVDADYKK 
LGTVDEDYKR 
LGTVDEDYKR 
LGTVDEDYKR 
LGTVDEDYKR 
LGTVDEDYKR 
VG-FVEAYNN 
VG-FVEAYNN 
VG-FVEAYNN 
VG-FVESYNN 
VG-FVQAYNN 
LP-TNDAYKN 
AG-FMKQYGE 



I I 

1065 
CTKGLS--IA 
CTKGLS — I A 
CSNGRS--VA 
CTGGYD — IA 
SAGGY D — I A 
CTGGYD— I A 
CTGGYD— IA 
CTGGAE— IR 
CTGGAE— I R 
CTGGQE — VR 
CTGGQE — VR 
CTGGAE— I R 
CTAGPLGFFK 
CLGDIN — AR 



1075 
DLACAQYYNG 
DLACAQYYNG 
DLVCAQYYSG 
DLVCAQYYNG 
DLVCARYYNG 
DLVCAQYYNG 
DLVCAQYYNG 
DLICVQSYKG 
DLICVQSYNG 
DLLCVQSFNG 
DLLCVQSFNG 
DLICVQSYNG 
DLACAREYNG 
DLICAQKFNG 



. ... I .... I 

1105 
IGGMVLGGLT 
IGGIALGGLT 
IGGMALGGIT 
AGGITLGALG 
TGGITLGALS 
AGGITLGALG 
AGGITLGALG 
TSASLFPLWT 
TSASLFPPLS 
TVSAMFP-WS 
TASAMFPPWS 
TAASLFPPWT 
VASMAFGGIT 
VSGTATAGWT 



....I. ...I 
. 1115 

S AAAIP 

S AVSIP 

A AAALP 

GG AVAIP 

GG— AVAIP 
GG— -AVAIP 
GG— -AVAIP 

A AAGVP 

A AVGVP 

A AAGVP 

A AAGVP 

A AAGVP 

A AGAIP 

FGAGAALQIP 



1125 
FSLALQARLN 
FSLAIQARLN 
FSYAVQARLN 
FAVAVQARLN 
FAVAVQARLN 
FAVAVQARLN 
FAVAVQARLN 
FYLNVQYRIN 
FYLNVQYRIN 
FSLSVQYRIN 
FALSVQYRIN 
FYLNVQYRIN 
FATQLQARIN 
FAMQMAYRFN 



....|. ...I 

1135 
YVALQTDVLQ 
YVALQTDVLQ 
YLALQTDVLQ 
YVALQTDVLN 
YVALQTDVLN 
YVALQTDVLN 
YVALQTDVLN 
GLGVTMDVLS 
GIGVTMDVLS 
GLGVTMNVLS 
GLGVTMNVLS 
GLGVTMDVLS 
HLGITQSLLL 
GIGVTQNVLY 
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....I. ...I ....I.. ..I ....|. ...| ....|.... | ....I. ...| ....|. ...| 
1145 1155 1165 1175 1185 1195 

EMCR S ENQKILAASF NKAINNIVAS FSSVNDAITH TAEAIHTVTI ALNKIQDWN QQGSALNHLT 

22 9E S ENQKILAASF NKAMTNIVDA FTGVNDAITQ TSQALQTVAT ALNKIQDWN QQGNSLNHLT 

PEDV RNQQLLAESF NSAIGNITSA FESVKEAISQ TSKGLNTVAH ALTKVQEWN SQGSALNQLT 

TGEV KNQQILASAF NQAIGNITQS FGKVNDAIHQ TSRGLATVAK ALAKVQDWN IQGQALSHLT 

CaCoV KNQQILANAF NQAIGNITQA FGKVNDAIHQ TSKGLATVAK ALAKVQDWN TQGQALSHLT 

FeCoV KNQQILANAF NQAIGNITQA FGKVNDAIHQ TSQGLATVAK ALAKVQDWN TQGQALSHLT 

Por Resp C KNQQILASAF NQAIGNITQS FGKVNDAIHQ TSRGLTTVAK ALAKVQDWN TQGQALRHLT 

OC43 QNQKLIANAF NNALYAIQEG FDATN S ALVKIQAWN ANAEALNNLL 

BoCoV QNQKLIANAF NNALDAIQEG FDATN S ALVKIQAWN ANAEALNNLL 

MHV EN Q KM I ASA F NNAIGAIQEG FAATN S ALAKMQFWN ANAEALNNLL 

Rat CoV ENQKMIASSF NNAIGAIQEG FDATN S ALAKIQSWN ANAEALNNLL 

PHEV QNQKLIASAF NNALDAIQEG FDATN S ALVKIQAWN ANAEALNNLL 

AIBV KNQEKIAASF NKAIGHMQEG FRSTS L ALQQIQDWS KQSAILTETM 

SARS ENQKQIANQF NKAISQIQES LTTTS T ALGKLQDWN QNAQALNTLV 

....i.... i . ... i .... i — i — i 

1205 1215 1225 1235 1245 1255 

EMCR S SQLRHNFQAI SNSIHAIYDR LDSIQADQQV DRLITGRLAA LNAFVSQVLN KYTEVRGSRR 

22 9E S SQLRQNFQAI SSSIQAIYDR LDTIQADQQV DRLITGRLAA LNVFVSHTLT KYTEVRASRQ 

PEDV VQLQHNFQAI SSSIDDIYSR LDILSADVQV DRLITGRLSA LNAFVAQTLT KYTEVQASRK 

TGEV VQLQNNFQAI SSSISDIYNR LDELSADAQV DRLITGRLTA LNAFVSQTLT RQAEVRASRQ 

CaCoV VQLQNNFQAI SSSISDIYNR LDELSADAQV DRLITGRLTA LNAFVSQTLT RQAEVRASRQ 

FeCoV VQLQNNFQAI SSSISDIYNR LDELSADAQV DRLITGRLTA LNAFVSQTLT RQAEVRASRQ 

Por Resp C VQLQNNFQAI SSSISDIYNR LDELSADAQV DRLITGRLTA LNAFVSQTLT RQAEVRASRQ 

OC43 QQLSNRFGAI SASLQEILSR LDALEAEAQI DRLINGRLTA LNAYVSQQLS DSTLVKFSAA 

BoCoV QQLSNRFGAI SSSLQEILSR LDALEAQAQI DRLINGRLTA LNVYVSQQLS DSTLVKFSAA' 

MHV NQLSNRFGAI SASLQEILSR LDALEAQAQI DRLINGRLTA LNAYVSKQLS DMTLVKVSAA 

Rat CoV NQLSNRFGAI SASLQEILSR LDALEAQAQI DRLINGRLTA LNAYVSKQLS DMTLIKVSAA 

PHEV QQLSNRFGAI SASLQEILSR LDALEAKAQI DRLINGRLTA LNAYVSQQLS DSTLVKFSAA 

AIBV ASLNKNFGAI SSVIQEIYQQ FDAIQANAQV DRLITGRLSS LSVLASAKQA EYIRVSQQRE 

SARS KQLSSNFGAI SSVLNDILSR LDKVEAEVQI DRLITGRLQS LQTYVTQQLI RAAEIRASAN 



EMCR S 

229E S 

PEDV 

TGEV 

CaCoV 

FeCoV 

Por Resp C 
OC43 
BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



....|. ...| 

1265 
LAQQKINECV 
LAQQKVNECV 
LAQQKVNECV 
LAKDKVNECV 
LAKDKVNECV 
LAKDKVNECV 
LAKDKVNECV 
QAMEKVNECV 
QAMEKVNECV 
QAIEKVNECV 
QAIEKVNECV 
QAIEKVNECV 
LATQKINECV 
LAATKMSECV 



1275 
KSQSNRYGFC 
KSQSKRYGFC 
KSQSQRYGFC 
RSQSQRFGFC 
RSQSQRFGFC 
RSQSQRFGFC 
RSQSQRFGFC 
KSQSSRINFC 
KSQSSRINFC 
KSQSSRINFC 
KSQSPRINFC 
KSQSSRINFC 
KSQSIRYSFC 
LGQSKRVDFC 



....|. ...| 

1285 
G-NGTHIFSI 
G-NGTHIFSI 
GGDGEHIFSL 
G-NGTHLFSL 
G-NGTHLFSL 
G-NGTHLFSL 
G-NGTHLFSL 
G-NGNHIISL 
G-NGNHIISL 
G-NGNHILSL 
G-NGNHILSL 
G-NGNHIISL 
G-NGRHVLTI 
G-KGYHLMSF 



....! ....| 

1295 
VNSAPDGLLF 
VNAAPEGLVF 
VQAAPQGLLF 
ANAAPNGMIF 
ANAAPNGMIF 
ANAAPNGMIF 
ANAAPNGMIF 
VQNAPYGLYF 
VQNAPYGLYF 
VQNAPYGLYF 
VQNAPYGLYF 
VQNAPYGLYF 
PQNAPNGIVF 
PQAAPHGWF 



• t • 



1305 
LHTVLLPTDY 
LHTVLLPTQY 
LHTVLVPGDF 
FHTVLLPTAY 
FHTVLLPTAY 
FHTVLLPTAY 
FHTVLLPTAY 
IHFSYVPTKY 
IHFSYVPTKY 
IHFSYVPTSF 
IHFSYVPTSF 
IHFSYVPTKY 
IHFSYTPDSF 
LHVTYVPSQE 



I I 

1315 
KNVKAWSGIC 
KDVEAWSGLC 
VNVLAIAGLC 
ETVTAWPGIC 
ETVTAWSGIC 
ETVTAWSGIC 
ETVTAWSGIC 
VTARVSPGLC 
VTAKVSPGLC 
TTANVSPGLC 
TTVNVSPGLC 
VTAKVSPGLC 
VNVTAIVGFC 
RNFTTAPAIC 
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TGEV. 
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FeCoV 
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BoCoV 
MHV 

Rat CoV 
PHEV 
AIBV 
SARS 



....I.. ..| 
1325 

VDG IYG 

VDG TNG 

VNG EIA 

ASDG-DRTFG 
ASDG-SRTFG 
ASDG-DRTFG 
ALDV-DRTFG 

I AG DRG 

I AG DRG 

ISG DRG 

ISG DRG 

I AG DIG 

VKPANASQYA 
HEG KA 



....|.. .-| 

1335 
YVLRQPNLVL 
YVLRQPNLAL 
LTLREPGLVL 
LVVKDVQLTL 
LVVEDVQLTL 
LWKDVQLTL 
LWKDVQLTL 
IAPKSGYFVN 
IAPKSGYFVN 
LAPKAGYFVQ 
LAPKAGYFVQ 
rSPKSGYFIN 
IVPANGRGIF 
YFPREGVFVF 



....|....| . . . . | | 

1345 1355 

YS DN GVFRVTSRVM 

YK ! EG NYYRITSRIM 

FTHELQTYTA TEYFVSSRRM 

FRN -"--LD DKFYLTPRTM 

FRN LD EKFYLTPRTM 

FRN LD DKFYLTPRTM 

FRN LD DKFYLTPRTM 



VN 

VN 

DD 

DH 

VN--- 
IQVN- 
NG— 



NTWMYTGSGY 
NTWMFTGSGY 
GEWKFTGSNY 
GEWKFTGSNY 
NSWMFTGSSY 
GSYYITAROM 
TSWFITQRNF 



....|....| 

1365 
FQPRLPVLSD 
FEPRIPTMAD 
FEPRKPTVSD 
YQPRVATSSD 
YQPRVATSSD 
YQPRVATSSD 
YQPRVATSSD 
YYPEPITENN 
YYPEPITGNN 
YYPEPITDKN 
YYPESITDKN 
YYPEPITQNN 
YMPRAITAGD 
FSPQIITTDN 



1375 
FVQIYNCNVT 
FVQIENCNVT 
FVQIESCWT 
FVQIEGCDVL 
FVQIEGCDVL 
FVQIEGCDVL 
FVQIEGCDVL 
WVMSTCAVN 
VWMSTCAVN 
SWMSSCAAN 
SWMSSCAVN 
WVMSTCAVN 
VVTLTSCQAN 
TFVSGNCDW 
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1385 
FVNISRVELH 
FVNISRSELQ 
YVNLTSDQLP 
FVNATVSDLP 
FVNGTVIELP 
FVNATVIDLP 
FVNTTVSDLP 
YTKAPYVMLN 
YTKAPDVMLN 
YTKAPEVFLN 
YTKAPEVFLN 
YTKAPDLMLN 
YVSVNKTVIT 
IGIINNTVYD 



....|....| 

1395 
TVIP-DYVDV 
TIVP-EYIDV 
DVIP-DYIDV 
SIIP-DYIDI 
SIIP-OYIDI 
SIIP-DYIDI 
SIIP-DYIDI 
TSIP-NLPDF 
ISTP-NLHDF 
TSIP-NLPDF 
TSIT-NLPDF 
TSTP-NLPDF 
TFVDNDDFDF 
PLQP-ELDSF 



....I. ...| 

1405 
NKTLQEFAQN 
NKTLQELSYK 
NKTLDEILAS 
NQTVQDILEN 
NQTVQDILEN 
NQTVQDILEN 
NQTVQDILEN 
KEELDQWFKN 
KEELDQWFKN 
KEELDKWFKN 
KEELDKWFKN 
KEELYQWFKN 
NDELSKWWND 
KEELDKYFKN 



.... I .... I 

1415 
L-PKYVKPNF 
L-PNYTVPDL 
L-PNRTGPSL 
FRPNWTVPEL 
FRPNWTVPEL 
YRPNWTVPEF 
FRPNWTVPEL 
QTSVAPDLSL 
QTSVAPDLSL 
QTSIAPDLSL 
QTSIVPDLSF 
QSSVAPDLSL 
T— KHELPDF 
HTSPDVDLGD 



... -I I 

1425 
DLTPFNLTYL 
WEQYNQTIL 
PLDVFNATYL 
TFDIFNATYL 
PLDI FHATYL 
TLDIFNATYL 
TLDVFNATYL 
DY — INVTFL 
DY — INVTFL 
DFEKLNVTLL 
DIGKLNVTFL 
DY— INVTFL 
DKFNYTVPIL 
ISG-INASW 



.... I I 

1435 
NLSSELKQLE 
NLTSEISTLE 
NLTGEIADLE 
NLTGEIDDLE 
NLTGEINDLE 
NLTGEIDDLE 
NLTGEIDDLE 

DLQVEMN 

DLQDEMN 

DLTDEMN 

DLSYEMN 

DLQDEMN 

DIDSEID 

NIQKEID 
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....I.. ..I I ....|.. ..| I 1 ....I. ...I . ... | .... | 

1445 1455 1465 1475 1485 1495 

EMCR S AKTASLFQTT VELQGLIDQI NSTYVDLKLL NRFENYIKWP WWVWLIISW FWLLSLLVF 

229E S NKSAELNYTV QKLQTLIDNI NSTLVDLKWL NRVETYIKWP WWVWLCISVV LIFWSMLLL 

PEDV QRSESLRNTT EELRSLINNI NNTLVDLEWL NRVETYIKWP WWVWLIIVIV LIFWSLLVF 

TGEV FRSEKLHNTT VELAILIDNI NNTLVNLEWL NRI ETYVKWP WYVWLLIGLV VIFCIPLLLF 

CaCoV FRSEKLHNTT VELAILIDNI NNTLVNLEWL NRI ETYVKWP WYVWLLIGLV VIFCIPILLF 

FeCoV FRSEKLHNTT VELAILIDNI NNTLVNLEWL NRI ETYVKWP WYVWLLIGLV WFCIPLLLF 

Por Resp C FRSEKLHNTT VELAILIDNI NNTWNLEWL NRI ETYVKWP WYVWLLIGLV VIFCIPLLLF 

OC43 RLQEAIKVL NQSYINLKDI GTYEYYVKWP WYVWLLICLA GVAMLVLLFF 

BoCoV RLQEAIKVL NQSYINLKDI GTYEYYVKWP WYVWLLIGFA GVAMLVLLFF 

MHV -RIQDAIKKL NESYINLKDV GTYEMYVKWP WYVWLLIGLA GVAVCVLLFF 

Rat CoV -RIQDAIKNL NESYINLKEI GTYEMYVKWP WYVWLLIGLA GVAVCVLLFF 

PHEV RLQEAIKVL NQSYINLKDI GTYEYYVKWP WYVWLLIGLA GVAMLVLLFF 

AIBV RIQGVIQGL NDSLIDLEKL SILKTYIKWP WYVWLAIAFA TIIFILILGW 

SARS RLNEVAKNL NESLIDLQEL GKYEQYIKWP WYVWLGFIAG LIAIVMVTIL 

....I. ...I ....I. ...| ....|....| ....|.. ..| ....|.... 
1505 1515 1525 1535 1545 

EMCR S CCLSTGCCGC CNCLTSSMRG CCDCGSTKLP YYEFEKVHVQ 

229E S CCCSTGCCGF FSCFASSIRG CCES — TKLP YYDVEKIHIQ 

PEDV CCISTGCCGC CGCCGACFSG CCRG-PRLQP YEAFEKVHVQ 

TGEV CCCSTGCCGC IGCLGSCCHS ICSR-RQFEN YEPIEKVHVH 

CaCoV CCCSTGCCGC IGCLGSCCHS ICSR-GQFES YEPIEKVHVH 

FeCoV CCFSTGCCGC IGCLGSCCHS ICSR-RQFEN YEPIEKVHVH 

Por Resp C CCCSTGCCGC IGCLGSCCHS IFSR-RQFEN YEPIEKVHVH 

OC4 3 ICCCTG-CG- -TSCFKKCGG CCDDYTGYQE LVIKT SH DD 

BoCoV ICCCTG-CG- -TSCFKICGG CCDDYTGHQE LVIKT— SH DD 

MHV ICCCTG-CG- -SCCFKKCGN CCDECGGHQD SIVIHNISSH ED 

Rat CoV ICCCTG-CG- -SCCFKKCGN CCDEYGGRQA GIVIHNISSH ED 

PHEV ICCCTG-CG- -TSCFKKCGG CCDDYTGHQE FVIKT SH DD 

AIBV VFFMTGCCGC CCGCFGIMPL MSKCGKKSSY YTTFDNDWT EQYRPKKSV 

SARS LCCMTSCCS- -CLKGACSCG SCCKFDEDDS EPVLKGVKLH YT 



f. Putative Orf 4a 

....i. ...i 

5 15 25 35 45 55 

EMCR 4a MPFGGLFQLT LESTINKSVA NLKLPPHDVT VLRDNLKPVT TLSTITAYLL VSLFVTYFAL 

22 9E 4a MALG-LFTLQ LVSAVNQSLS NAKVSAEVSR QVIQDVKDGT VTFNLLAYTL MSLFWYFAL 

....I. ...I | |" | | ....|. ...| ....|....| ....|....| 

65 75 85 95 105 115 

EMCR 4a FKPLTARGRV ACFVLKLLTL SVYVPLLVLF GMYLDSFIIF FLRCCFDSYM LAIMPISNKN 

229E 4a FKARSHRGRA ALIVFKILIL FVYVPLLYWS QAYIYATLIA VILLG-RFFH TAWHCWLYKT 

■ ... I .... I ....I. ...I ....|.... | I .... I ....|. ...I -...|.... I 

125 135 145 155 165 175 
EMCR 4a FSFVLFNVTK LCFVSGKCWY LEQSFYENRF AAIYGGDHYV VLGGETITFV SFDDLYVAIR 
229E 4a WDFIVFNVTT LCYAR 

,...|....| ....|....| ....|. ...| ....|. 
185 195 205 215 225 
EMCR 4a GSCEKNLQLM RKVDLYNGAV IYIFAEEPW GIVYSSQLYE DVPSIN 
229E 4a - 
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g. Putative Orf4ab 



EMCR 4a 
229E 4a 
229E 4b 



EMCR 4a 
229E 4a 
229E 4b 



EMCR 4a 
229E 4a 
229E 4b 



EMCR 4a 
229E 4a 
229E 4b 



....I.. ..| I | ....|.. ..I . 1 .... | ....|.. ..| 

5 15 25 35 45 55 

MPFGGLFQLT LESTINKSVA NLKLPPHDVT VLRDNLKPVT TLSTITAYLL VSLFVTYFAL 
MALG-LFTLQ LVSAVNQSLS NAKVSAEVSR QVIQDVKDGT VTFNLLAYTL MSLFWYFAL 

....|.... I ....|....| 1 .... I ....I.... I • . • . I I 

65 75 85 95 105 115 

FKPLTARGRV ACFVLKLLTL SVYVPLLVLF GMYLDSFIIF FLRCCFDSYM LAIMPISNKN 

FKAR5HRGRA ALIVFKILIL FVYVPLLYWS QAYIYATLIA VILLG-RFFH TAWHCWLYKT 

....|. ...| ....|....| ....|....| ....|.. ..| ....I. ...| ....|. ...| 

125 135 145 155 165 175 

FSFVLFNVTK LCFVSGKCWY LEQSFYENRF AAIYGGDHYV VLGGETITFV SFDDLYVAIR 

WDFIVFNVTT LCYAR 

MQGKCW FLENKALKPF VCFYGGDQFL YIGDRIVSYF STNDLYVALR 

....I. ...| ....|. ...I ....|....| ....|.... | ,...|. 

185 195 205 215 225 

GSCEKNLQLM RKVDLYNGAV IYIFAEEPW GIVYSSQLYE DVPSIN 

GRIDKDLSLS RKVELYNGEC VYLFCEHPAV GIVNTDFKLE IH 



h. Putative OrfE 



EMCR E 

229E 

PEDV 

TGEV 

CaCov 

FeCov 

Por Resp C 

OC43 

BoCoV 

PHEV 

MHV 

Rat CoV 

AIBV 

SARS 



....|.. ..I 
5 

MFLRLI 

MFLKLV 

MLQLV 

MTFPRALTVI 
MTFPRALTVI 
MTFPRAFTII 
MTFPRALTVI 
--MFMADAYL 
— MFMADAYF 
— MFMADAYL 

MFNLFL 

MFNLFL 

— MNLLNKSL 
MYSFVS 



....!.... I 
15 

DDNG-IVLNS 
DDHA-LVVNV 
NDNG-LVVNV 
DDNG-MVINI 
DDNG-MVISI 
DDHG-MVVSV 
DDNG-MVISI 
ADTV-WYVGQ 
ADTV-WYVGQ 
ADTV-WYVGQ 
TDTV-WYVGQ 
IDTV-WYVGQ 
EENG-SFLTA 
EETGTLIVNS 



,...|.... I 
25 

ILWLLVMIFF 
LLWCWLIVI 
ILWLFVLFFL 
IFWFLLIIIL 
IFWFLLIIIL 
FFWLLLIIIL 
IFWFLLIIIL 
IIFIVAICLL 
IIFIVAICLL 
IIFIVAICLL 
IIFIVAVCLM 
IIFIVAVCLM 
LYIIVGFLAL 
VLLFLAFWF 



......... I 

35 

F-VLAMTFIK 
L-LVCITIIK 
L-IISITFVQ 
I-LLSIALLN 
I-LFSIALLN 
I-LFSIALLN 
I-LLSIALLN 
VTIVWAFLA 
VIIVWAFLA 
VIIVWAFLA 
VTIIWAFLA 
VTIIWAFLA 
Y-LLGRALQA 
L-LVTLAILT 



....I I 

45 

LIQLCFTCHY 
LIKLCFTCHM 
LVNLCFTCHR 
IIKLCMVCCN 
IIKLCMVCCN 
VIKLCMVCCN 
IIKLCMVCCN 
TFKLCIQLCG 
TFKLCIQLCG 
TFKLCIQLCG 
SIKLCIQLCG 
SIKLCIQLCG 
FVQAADACCL 
ALRLCAYCCN 



....|....| 
55 

FFSRTLYQP- 
FCNRTVYGP- 
LCNSAVYTP- 
LGRTVIIVP- 
LGRTVIIVP- 
LGKTIIVLP- 
LGRTVIIVP- 
MCNTLVLSP- 
MCNTLVLSP- 
MCNTLVLSP- 
LCNTLLLSP- 
LCNTLLLSP- 
FWYTWVVIPG 
IVNVSLVKP- 
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65 

VYKIFL- 

IKNVYH- 

IGRLYR- 

AQHAYD- 

ARHAYD- 

ARHAYD- 

VQHAYD- 

SIYVFNR 

SIYVFNR 

SIYVFNR 

— SICVYNR 

SIYVYNR 

AKGTAFVYKY 
TVYVYS- 



-I.. 
75 



GR- 
GR- 
GR- 
SK- 
SK- 



■ • I I 

85 

--AYQDYM 
— IYQSYM 
— VYKSYM 
--AYKNFM 
— AYKNFM 
--AYKTFM 
—AYKNFM 
-QFYEFYN 
-QFYEFYN 
-QFYEFYN 
-QLYKYYN 
-QLYKYYN 



TYGRKLNNPE LEAVIVNEFP 
RVKNLN 



..| 

95 

— QIAPV-PA 
--HIDPF-PK 
— RIDPL-PS 
— RIKAYNPD 
— QIRAYNPD 
— QTKAYNPD 
— RIKAYNPD 
— DVKPP-VL 
— DVKPP-VL 
--DVKPP-VL 
E-EVRPP-PL 
E-EVRPP-PL 
KNGWNNKNPA 
— SSEGV-PD 



. ... I .... I 
105 

EVLNV 

RVIDF 

TVIDV 

GALLA 

EALLV 

EAFLV 

GALLV 

DVDDV 

DVDDV — 

DVDDV 

EVDDIIIQTL — 
EVDDIIIQTL — 
NFQOAQRDKL YS 
LLV 
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i . Putative Orf M (Matrix protein) 

...i — i — i ....i.. ,.i ....i. ...i ....i. ...| 

5 15 25 35 45 55 

EMCR M SNSS 

229E M SNDN 

PEDV M SNGS 

TGEV * MK ILL I LAC VI A CACGERYCAM KSDTDLSCRN 

CaCoV MKK ILFLLACAIA CVYGERYCAM TESS-TSCRN 

FeCoV MHMMPIRPLC KPRHIIPTKH FWFELNKMKY ILLILACIIA CVYGERYCAM QDSG-LQCIN 

PRCOV MK ILLILACAIA CTCGERYCAM KDDTGLSCRN 

OC43 M SSKT 

PHEV M SSPT 

BoCoV M SSVT 

MHV M TSTTQ 

RatSAV M SSTTP 

AIBV M PNETN 

SARS M ADNG 

...,|....| ....|. ...| ....|. ...| ....|....| . ... | .... I ....|....| 
65 75 85 95 105 115 

EMCR V PLSEVYVHLR NWNFSWNLIL TVFIWLQYG HYKYSRLLYG LKMSVLWCLW 

229E C -TGDIVTHLK NWNFGWNVIL TIFIVILQFG HYKYSRLFYG LKMLVLWLLW 

PEDV 1 PVDEVIEHLR NWNFTWNIIL TILLWLQYG HYKYSVFLYG VKMAILWILW 

TGEV STASDCESCF NGGDLIWHLA NWNFSWSIIL IVFITVLQYG RPQFSWFVYG IKMLIMWLLW 

CaCoV STAGNCASCF ETGDLIWHLA NWNFSWSVIL IIFITVLQYG RPQFSWFVCG IKMLIMWLLW 

FeCoV ' GTNSRCQTCF ERGDLIWHLA NWNFSWSVIL IVFITVLQYG RPQFSWLVYG IKMLIMWLLW 

PRCOV GTASDCESCF NRGDLIWLLA NWNFSWSIIL IIFITVLQYG RPQFSWFVYG IKMLIMWLLW 

OC43 — TPAPVYIW TADEAIKFLK EWNFSLGIIL LFITIILQFG YTSRSMFVYV IKMIILWLMW 

PHEV — TPVPVISW TADEAIKFLK EWNFSLGIIV LFITIILQFG YTSRSMFVYV IKMVILWLMW 

BOCOV — TPAPVYTW TADEAIKFLK EWNFSLGIIL LFITIILQFG YTSRSMFVYV IKMIILWLMW 

MHV — APQPVYQW TADEAIRFLK EWNFSLGIIL LFVTIILQFG YTSRSMFVYV VKMILLWLMW 

RatSAV — APQTVYQW TADVAVRFLK EWNFLLGIIL LFITIILQFG YTSRSMFIYV VKMIILWLMW 

AIBV CTL DFEQSVQLFK EYNLFITAFL LFLTIILQYG YATRSKVIYT LKMIVLWCFW 

SARS TI TVEELKQLLE QWNLVIGFLF LAWIMLLQFA YSNRNRFLYI I KLVFLWLLW 

....I. ...| ....|. ...| ....|.. ..| ....|. ...| . ... I .... I 
125 135 145 155 165 175 

EMCR PLVLALSIFD CFVNFNVD-W VFFGFSILMS IITLCLWVMY FVNSFRLWRR VKTFWAFNPE 

22 9E PLVLALSIFD TWANWDSN-W AFVAFSFFMA VSTLVMWVMY FANSFRLFRR ARTFWAWNPE 

PEDV PLVLALSLFD AWASFQVN-W VFFAFSILMA CITLMLWIMY FVNSIRLWRR THSWWSFNPE 

TGEV PVVLALTIFN AYSEYQVSRY VMFGFSIAGA IVTFVLWIMY FVRSIQLYRR TKSWWSFNPE • 

CaCoV PIVLALTIFN AYLEYRVSRY VMFGFSVAGA TVTFILWIMY FVRSIQLYRR TKSWWSFNPE 

FeCoV PIVLALTIFN AYSEYQVSRY VMFGFSVAGA WTFALWMMY FVRSVQLYRR TKSWWSFNPE 

PRCoV PIVLALTIFN AYSEYQVSRY VMFGFSIAGA IVTFVLWIMY FVRSIQLYRR TKSWWSFNPE 

OC43 PLTIILTIFN — CVYALN-N VYLGLSIVFT IVAIIMWIVY FVNSIRLFIR TGSFWSFNPE 

PHEV PLTIILTIFN — CVYALN-N VYLGFSIVFT IVAIIMWWY FVNSIRLFIR TGSWWSFNPE 

BOCOV PLTIILTIFN —CVYALN-N VYLGFSIVFT IVAIIMWIVY FVNSIRLFIR TGSWWSFNPE 

MHV PLTIVLCIFN —CVYALN-N VYLGFSIVFT IVSIIMWIMY FVNSIRLFIR TGSWWSFNPE 

RatSAV PLTIVLCIFN --CVYALN-N VYLGFSIVFT IVSIVMWIMY FVNSIRLFIR TGSWWSFNPE 

AIBV PLNIAVGVIS — CTYPPN-T GGLVAAIILT VFACLSFVGY WIQSIRLFKR CRSWWSFNPE 

SARS PVTLACFVLA — AVYRIN-W VTGGIAIAMA CIVGLMWLSY FVASFRLFAR TRSMWSFNPE 

• ... I .... I ....I.. ..I ....|....| ....|. ...| ....|....| \ I 

185 195 205 215 225 235 

EMCR TNAIISLQVY -GHNYYLPVM AAPTGVTLTL LSGVLLVDGH KIATRVQVGQ LPKYVIVATP 

22 9E VNAITVTTVL -GQTYYQPIQ QAPTGITVTL LSGVLYVDGH RLASGVQVHN LPEYMTVAVP 

PEDV TDALLTTSVM -GRQVCIPVL GAPTGVTLTL LSGTLLVEGY KVATGVQVSQ LPNFVTVAKA 

TGEV TKAILCVSAL -GRSYVLPLE GVPTGVTLTL LSGNLYAEGF KIAGGMNIDN LPKYVMVALP 

CaCoV TSAILCVSAL -GRSYVLPLE GVPTGVTLTL LSGNLCAEGF KIAGGMNIDN LPKYVMVALP 

FeCoV TNAILCVNAL -GRSYVLPLD GTPTGVTLTL LSGNLYAEGF KMAGGLTIEH LPKYVMIATP 

PRCoV TNAILCVSAL -GRSYVLPLE GVPTGVTLTL LSGNLYAEGF KIAGGMTIDN LPKYVMVALP 

OC43 TNNLMCIDMK -GTMYVRPII EDYHTLTVTI IRGHLYIQGI KLGTGYSLAD LPAYMTVAK- 

PHEV TNNLMCIDMK -GRMYVRPII EDYHTLTATI IRGHLYIQGI KLGTGYSLSD LPAYVTVAK- 

BOCOV TNNLMCIDMK -GRMYVRPII EDYHTLTVTI IRGHLYMQGI KLGTGYSLSD LPAYVTVAK- 

MHV TNNLMCIDMK -GTVYVRPII EDYHTLTATI IRGHLYMQGV KLGTGFSLSD LPAYVTVAK- 

RatSAV TNNLMCIDVK -GTVYVRPII EDYHTLTATN VRGHLYMQGV KLGTGFSLSD LPAYVTVAK- 

AIBV SNAVGSILLT NGQQCNFAIE SVPMVLSPII KNGVLYCEGQ WLAK-CEPDH LPKDIFVCTP 

SARS TNILLNVPLR -GTIVTRPLM ESELVIGAVI IRGHLRMAGH PLGR-CDIKD LPKEITVAT- 
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....|.. ..I ....|. ...| ....|. ...| ....|. ...| ,...|....| .. 
245 255 265 2*75 285 

EMCR STTIVCDRVG RSVNETSQTG WAFYVRAKHG DFSGVASQEG VLSEREKLLH LI 

229E STTIIYSRVG RSVNSQNSTG WVFYVRVKHG DFSAVSSPMS NMTENERLLH FF 

PEDV TTTIVYGRVG RSVNASSGTG WAFYVRSKHG DYSAVSNPSA VLTDSEKVLH LV 

TGEV SRTIVYTLVG KKLKASSATG WAYYVKSKAG DYSTEAR-TD NLSEQEKLLH MV 

CaCoV VRTIVYTLVG KKLKASSATG WAYYVKSKAG DYSTDAR-TD NLSEHEKLLH MV 

FeCoV SRTIVYTLVG KQLKATTATG WAYYVKSKAG DYSTEAR-TD NLSEHEKLLH MV 

PRCOV SRTIVYTLVG KKLKASSATG WAYYVKSKAG DYSTEAR-TD NLSEQEKLLH MV 

OC43 VTHLCTYKRG FLDRISDTSG FAVYVKSKVG NYRLPSTQKG SGMDTALLRN NI 

PHEV VTHLCTYKRG FLDRIGDTSG FAVYVKSKVG NYRLPSTHKG SGMDTALLRN NI 

BoCoV VSHLLTYKRG FLDKIGDTSG FAVYVKSKVG NYRLPSTQKG SGMDTALLRN NI 

MHV ' VSHLCTYKRA FLDKVDGVSG FAVYVKSKVG NYRLPSN-KP SGMDTALLR- -I 

RatSAV VSHLCTYKRA FLDKVDGVSG FAVYVKSKVG NYRLPSN-KP SGADTALLR- -I 

AIBV DRRNIYRMVQ KYTGDQSGNK KRFATFVYAK QSVDTGELES VATGGSSLYT — 

SARS SRTLSYYKLG ASQRVGTDSG FAAYNRYRIG NYKLNTDHAG SNDNIALLVQ — 



j . Putative Orf N (N ucleoprotein) 

5 15 25 35 45 55 

EMCR MAS VN W ADDR AARKKF 

229E MAT- — VK— W ADASEPQ RGRQGR 

PEDV MAS VS— F QDRG RKR 

TGEV MANQGQR VS— W GDESTKT RGRSNSRG RKNNN 

FeCoV MATQGQR- VN W GDEPSKR RGRSNSRG RKNND 

PRCOV MANQGQR VS— W GDESTKI RGRSNSRG RKINN 

CaCoV MASQGQR VS — -W GDESTKR RGRSNSRG RKNND 

RSDACoV MSFVPGQENA GSRSSSGNRA GNGILKKTTW ADQTERGQNN GNRGRRNQPK QTATTQ-PNT 

MHV MSFVPGQENA GSRSSSGNRA GNGILKKTTW ADQTERG NRGRRNHPK QTATTQ-PNA 

PHEV MSFTPGKQSS -SRASSGNRS GNGILK W ADQSDQSRNV QTRGRRVQSK QTATSQQPSG 

OC43 MSFTPGKQSS -SRASSGNRS GNGILK— W ADQSDQFRNV QTRGRRAQPK QTATSQQPSG 

BoCoV MSFTPGKQSS -SRASFGNRS GNGILK— W ADQSDQSRNV QTRGRRAQPK QTATSQLPSG 

SARS MSDNGPQS NQRSAPRITF GGPTDSTDNN QNGGRNGARP KQRRPQ 

AIBV MASG K A AGKTDAPAPV IKLGGPKPPK VGSS 

....|....| I I ....|....| ...,|. ...| ....|.. ..| ....|....| 

65 75 85 95 105 115 

EMCR PPPSFY MPLLVSSDKA PYRVIPRNLV PIGKGNK-DE QIGYWNVQER — WRMRRGQR 

229E IPYSLY SPLLVDSE-Q PWKVIPRNLV PINKKDK-NK LIGYWNVQKR — FRTRKGKR 

PEDV VPLSLY APLRVTNDKP LSKVLANNAV PTNKGNK-DQ QIGYWNEQIR — WRMRRGER 

TGEV IPLSFF NPITLQQGSK FWNLCPRDFV PKGIGNR-DQ QIGYWNRQTR — YRMVKGQR 

FeCoV IPLSFY NPITLEQGSK FWNLCPRDLV PKGIGNK-DQ QIGYWNRQIR — YRIVKGQR 

PRCOV IPLSFF NPITLQQGAK FWNSCPRDFV PKGIGNR-DQ QIGYWNRQTR —YRMVKGQR 

CaCoV IPLSFF NPITLEQGSK FWDLCPRDFV PKGIGNK-DQ QIGYWNRQTR — YRMVKGRR 

RSDACOV GSVVPHYSWF SGITQFQKGK EFQFAGGQGV PIANGIPPSE QKGYWYRHNR RSFKTPDGQQ 
MHV • GSVVPHYSWF SGITQFQKGK EFQFAQGQGV PIASGIPASE QKGYWYRHNR RSFKTPDGQH 

PHEV GTVVPYYSWF SGITQFQKGK EFEFAEGQGV PIAPGVPSTE AKGYWYRHNR RSFKTADGNQ 

OC43 GNWPYYSWF SGITQFQKGK EFEFVEGQGV PIAPGVPATE AKGYWYRHNR RSFKTADGNQ 

BoCoV GNWPYYSWF SGITQFQKGK EFEFAEGQGV PIAPGVPATE AKGYWYRHNR RSFKTADGNQ 

SARS GLPNNTASWF TALTQHGK-E ELRFPRGQGV PINTNSGPDD QIGYYRRATR R-VRGGDGKM 

AIBV GNASWF QAIKAKKLNT PPPKFEGSGV PDNENIKPSQ QHGYWRRQAR — FKPGKGGR ' 

....|.... I I I ....|....| ....|. ...| ....|.. ..| ....|.. ..| 

125 135 145 155 165 175 

EMCR VDLPPKVHFY YLGTGPHKDL KFRQRSDGVV WVAKEGAKTV NTSLGNRK— RNQKPLEPKF 

22 9E VDLSPKLHFY YLGTGPHKDA KFRERVEGW WVAVDGAKTE PTGYGVRR— KNSEPEIPHF 

PEDV IEQPSNWHFY YLGTGPHGDL RYRTRTEGVF WVAKEGAKTE PTNLGVRK— ASEKPIIPKF 

TGEV KELPERWFFY YLGTGPHADA KFKDKLDGW WVAKDGAMNK PTTLGSRG— ANNESKALKF 

FeCoV KELAERWFFY FLGTGPHADA KFKDKIDGVF WVARDGAMNK PTTLGTRG— TNNESKPLRF 

PRCoV KELPERWFFY YLGTGPHADA KFKDKLDGW WVAKDGAMNK PTTLGSRG-- ANNESKALKF 

CaCoV KNLPEKWFFY YLGTGPHADA KFKQKLDGVV WVARGDSMTK PTTLGTRG-- TNNESKALKF 

RSDACoV KQLLPRWYFY YLGTGPHAGA SFGDSIEGVF WVANSQADTN TSADIVERDP SSHEAIPTRF 

MHV KQLLPRWYFY YLGTGPHAGA EYGDDIEGVV WVASQQADTK TTADWERDP SSHEAIPTRF 

PHEV RQLLPRWYFY YLGTGPHAKD QYGTDIDGVF WVASNQADIN TPADIVDRDP SSDEAIPTRF 

OC43 RQLLPRWYFY YLGTGPHAKD QYGTDIDGVY WVASNQADVN TPADIVDRDP SSDEAIPTRF 

BOCOV RQLLPRWYFY YLGTGPHAKD QYGTDIDGVF WVASNQADVN TPADILDRDP SSDEAIPTRF 

SARS KELSPRWYFY YLGTGPEASL PYGANKEGIV WVATEGALNT PKDHIGTRNP NNNAATVLQL 

AIBV KPVPDAWYFY YTGTGPAADL NWGDTQDGIV WVAAKGADTK SRSNQGTRDP DKFDQYPLRF 
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....|. ...| ....|. ...| ....|.... | I I ....)....! 

185 195 205 215 225 235 

EMCR SIALPPELSV VEFEDRSNNS SRASSRSSTR NNSR DS 

22 9E NQKLPNGVTV VEEPD SRAPSRSQSR SQSR 

PEDV SQQLPSVVEI VEPNTPP— A SRANSRSRSR GNGNNRSRSP SNNRGNNQSR GNSQNRGNNQ 

TGEV DGKVPGEFQL EVNQS RONSRSRSQ 

FeCoV DGKIPPQFQL EVNRS RNNSRSGSQ - 

PRCoV DGKVPGEFQL EVNQS RONSRSRSQ 

CaCoV DVKVPSEFHL EVNQL RDNSRSRSQ 

RSDACoV APGTVLPQGF YVEGS GRSAPASRSG 

MHV APGTVLPQGF YVEGS GRSAPASRSG 

PHEV PPGTVLPQGY YIEGS GRSAPNSRST 

OC43 PPGTVLPQGY YIEGS GRSAPNSRST 

BOCOV PPGTVLPQGY YIEGS GRSAPNSRST 

SARS PQGTTLPKGF YAEGS RGGSQASSR 

AIBV SDGGPDGNFR WDFIP LN-RGRSG 





245 




255 


265 


275 


285 


295 


EMCR 


SRSTSRQ 




QSR- 


TRSDSNQS — 


S-SDL 


VAAVTLALKN 


LGFDN — QSK 


229E 


GRGESKP 




QSRN 


PSSDRNHN — 


SQDDI 


MKAVAAALKS 


LGFDKP-QEK 


PEDV 


' GRGASQNRGG NNNNNNKSRN 


QSNNRNQSND 


RGGVTSRDDL 


VAAVKDALKS 


LGIGEN-PDR 


TGEV 








RQQFNNKK— 


DDSV 


EQAVLAALKK 


LGVDTE-KQQ 


FeCoV 


SRSVSRNR-- 




SQSRG 


RHHSNNQ 


NNNV 


EDTIVAVLEK 


LGV-TD-KQ- 


PRCoV 








RQQSNNKK — 


DDSV 


EQAVLAALKK 


LGVYTE-KQQ 


CaCoV 


SRSQSRNR— 




SQSRG 


RQLSNNKK — 


DDNV 


EQAVLAALKK 


LGVDTE-KQQ 


RSDACoV 


SRSQSRGP— 




NNRA 


RSSSNQRQ-- 


PASTV 


KPDMAEEIAA 


LVLAN LG 


MHV 


SRSQSRGP— 




NNRA 


RSSSNQRQ — 




KPDMAEEIAA 


LVLAK LG 


PHEV 


SRAPNRAPS- 




AGSRS 


RANSGNRT — 


STPGV 


TPDMADQIAS 


LVLAK LG 


OC43 


SRTSSRASS- 




AGSRS 




PTSGV 


TPDMADQIAS 


LVLAK LG 


BoCoV 


SRASSRASS- 




AGSRS 


RANSGNRT — 


PTSGV 


TPDMADQIAS 


LVLAK LG 


SARS 


SSSRSRGN— 




SRNST 


PGSSRGNS-- 


PARMA 


SGGGETALAL 


LLLDRLNQLE 


AIBV 








PSREGSRG — 


RRSDS 


GDDLIARAAK 






305 




315 


325 


335 


345 


355 


EMCR 


SPSSSGTSTP 


K- 


K- 


PNKPLSQ 


PRADKPS 


-QLKKPRWKR 


VPTR — EENV 


229E 


DKKSAKTGTP 


KPSRNQSPAS SQTSAKSLAR 


SQSSETKEQK HEMQKPRWKR 


QPNDDVTSNV 


PEDV 


HKQQQKPKQE 


K- 


SDN SG 


KNTPKKNKSR 


ATSKERD 


-LKDIPEWRR 


IPKG--ENSV 


TGEV 


QRSRSKSKER 








DTTPKNE 


NKHTWKR 


TAGK GDV 


FeCoV 


-RSRSKPRER 








DTTPKNA 


NKHTWKK 


TAGK— GDV 


PRCoV 


QRSRSKSKER 








DTTPKNE 


NKHTWKR 


TAGK GDV 


CaCoV 


-RSRSKSKER 








DTTPKNE 


NKHTWKR 


TAGK GDV 


RSDACoV 


-KDAGQPKQV 


T- 




KQSAK 


EVRQKIL 


NKPRQKR 


TPNK — QCPV 


MHV 


-KDAGQPKQV 








EVRQKIL 


TKPRQKR 


TPNK — QCPV 


PHEV 


-KDATKPQQV 


T- 




KQTAK 


EVRQKIL 


NKPRQKR 


SPNK— QCTV 


OC43 


-KDATKPQQV 


T- 




KHTAK 


EVRQKIL— 


-—NKPRQKR 


SPNK--QCTV 


BOCOV 


-KDATKPQQV 






KQTAK 


EIRQKIL 


—NKPRQKR 


SPNK— QCTV 


SARS 


SKVSGKGQQQ 


Q- 






KSAAEAS 


— KKPRQKR 


TATK — QYNV 


AIBV 


QKKGSRI 


T- 




KAKAD 


EMAHRRY 


CKRT 


IPPN YRV 




365 




375 


385 


395 


405 


415 


EMCR 


IQCFGPRDFN 


H- 


— NMGDSD 


LVQNGVDAKG 


FPQLAELI PN 


QAALFFDSEV 


STDEVG 


229E 


TQCFGPRDLO 


H- 


--NFGSAG 


WANGVKAKG 


YPQFAELVPS 


TAAMLFDSHI 


VSKESG 


PEDV 


AACFGPRGGF 


K- 


— NFGDAE 


FVEKGVDASG 


YAQIASLAPN 


VAALLFGGNV 


AVRELA 


TGEV 


TRFYGARSSS 


A- 


— NFGDTD 


LVANGSSAKH 


YPQLAECVPS 


VSSILFGSYW 


TSKEDG 


FeCoV 


TTFYGARSSS 


A- 


— NFGDSD 


LVANGNAAKC 


YPQIAECVPS 


VSSIIFGSQW 


SAEEAG 


PRCOV 


TRFYGARSSS 


A- 


--NFGDSD 


LVANGSSAKH 


YPQLAECVPS 


VSSILFGSYW 


TSKEDG 


CaCoV 


TKFYGARSSS 


A- 


—NFGDSD 


LVANGNGAKH" 


YPQLAECVPS 


VSSILFGSHW 


TAKEDG 


RSDACoV 


QQCFGKRGPN 


Q- 


— NFGGPE 


MLKLGTSDPQ 


FPILAELAPT 


PGAFFFGSKL 


ELVKKN— SG 


MHV . 


QQCFGKRGPN 


Q- 


--NFGGSE 


MLKLGTSDPQ 


FPILAELAPT 


PSAFFFGSKL 


ELVKKN — SG 


PHEV 


QQCFGKRGPN 


Q- 


— NFGGGE 


MLKLGTSDPQ 


FPILAELAPT 


AGAFFFGSRL 


ELAKVQNLSG 


OC43 


QQCFGKRGPN 


Q- 


—NFGGGE 


MLKLGTSDPQ 


FPILAELAPT 


AGAFFFGSRL 


ELAKVQNLSG 


BoCoV 


QQCFGKRGPN 


Q- 


— NFGGGE 


MLKLGTSDPQ 


FPILAELAPT 


AGAFFFGSRL 


ELAKVQNLSG 


SARS 


TQAFGRRGPE 


QTQGNFGDQD 


LIRQGTDYKH 


WPQIAQFAPS 


ASAFFGMSRI 


GMEVTP 


AIBV 


DQVFGPRTKG 


K- 


EGNFGDDK 


MNEEGIKDGR 


VTAMLNLVPS 


SHACLFGSRV 


TPKLQL 



....I. ...I ♦ • . . I I ....!....! I 





425 


435 


445 


455 


465 


475 


EMCR 


DNV 


QITYT Y 


KMLVAKDNKN 


LPKFIEQISA 


FTKPS 


SIKEMQSQSS 


229E 


NTV 


VLTFT T 


RVTVPKDHPH 


LGKFLEELNA 




EMQQHPLLNP 


PEDV 


DSY 


EITYN Y 


KMTVPKSDPN 


VELLVSQVDA 




AKLQRKKEKK 


TGEV 


DQI 


EVTFT H 


KYHLPKDDPK 


TGQFLQQINA 


YARPS 


EVAKEQRKRK 


FeCov 




KVTLT H 


TYYLPKDDAK 


TSQFLEQI DA 


YKRPS 


EVAKDQRQRR 


PRCoV 


DQI 


EVTFT H 


KYHLPKDHPK 


TEQFLQQINA 


YASPS 


ELAKEQRKRK 


CaCoV 


DQI 


EVTFT H 


KYHLPKDDPK 


TGQFLQQINA 


YARPS 


EVAKEQRQRK 


RSDACoV 


GVDEPTKDVY 


ELQYSGAVRF 


DSTLPGFETI 


MKVLNENLNA 


YQNQA 


GGADVVSPKP 


MHV 


GADEPTKDVY 


ELQYSGAIRF 


DSTLPGFETI 


MKVLTENLNA 


YQDQA 


GSVDLVSPKP 


PHEV 


NPDEPQKDVY 


ELRYNGAIRF 


DSTLSGFETI 


MKVLNQNLNA 


YQHQE 


DGMMNISPKP 


OC43 


NPDEPQKDVY 


ELRYNGAIRF 


DSTLSGFETI 


MKVLNENLNA 


YQQQ 


DGMMNMSPKP 


BOCOV 


NLDEPQKDVY 


ELRYNGAIRF 


DSTLSGFETI 


MKVLNENLNA 


YQQQ 


DGMMNMSPKP 


SARS 


-— SGT 


WLTYHGAIKL 


DDKDPQFKDN 


VILLNKHIDA 


YKTFP 


PTEPKKDKKK 


AIBV 




HLRFEFTTW 


PCDDPQFDNY 


VKICDQCVDG 


VGTRPKDDEP 


KPKSRSSSRP 
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485 495 505 515 525 535 

EMCR HVAQNTVLN AS I PES KPLADDDSA IIEIVNEVLH 

229E SALEFNPSO TSPATA EPVRDEVSI ETDIIDEVN 

PEDV NKRETTLQQH EEAIYDDVGA PS DVT HAN LE WDTAVDGGDT AVEIINEIFD TGN 

TGEV SRSKSAERS EQDWPDALI ENYTDVFDDT QVEIIDEVTN 

FeCoV SRSKSADK KPEELSVTLV EAYTDVFDDT QVEMI DEVTN 

PRCOV SRSKSAERS EQEWPDSLI ENYTDVFDDT QVEMI DEVTN 

CaCoV ARSKSVERV EQEVVPDALT ENYTDVFDDT QVEIIDEVTN 

RSDACoV QRKRGTKQT- -AQKEELDSI SVAKPKSAVQ RNVSRELTPE DRSLLAQILD DGWPDGLDD 

MHV PRRGRRQAQ- -EKKDEVDNV SVAKPKSLVQ RNVSRELTPE DRSLLAQILD DGWPDGLED 

PHEV QRQRGQKN GQVENDNV SVAAPKSRVQ QNKSRELTAE DISLLKKMDE P YTED 

OC43 QRQRGHKN GQGENDNI SVAVPKSRVQ QNKSRELTAE DISLLKKMDE P YTED 

BOCOV QRQRGQKN GQGENDNI SVAAPKSRVQ QNKSRELTAE DISLLKKMDE P YTED 

SARS KTDEAQPLP QRQKKQPTVT LLPAADMDDF SRQLQNSMSG ASADSTQA — 

AIBV ATRGNS PAPR QQRPKKEKKL KKQDDEADKA LTSDEERNNA QLEFYDEPKV INWGDAALGE 



EMCR 

229E 

PEDV 

TGEV 

FeCoV 

PRCOV 

CaCov 

RSDACoV -SNV ■ 

MHV DSNV 

PHEV TSEI 

OC43' TSEI 

BoCoV TSEI 

SARS 

AIBV NEL- 



k. 5 'untranslated region (genomic sequence) 



EMCR5 ' UTR 
229E5*UTR 



EMCR5 ' OTR 
229E5'UTR 



• ... I .... I • - • • I I ....I.. ..| . . I I ....|....| 

5 15 25 35 45 55 

AGATAGA GAATTTTCTT ATTTAGACTT TGTGTCTACT 

ACTTAAGTAC CTTATCTATC TACAGATAGA AAAGTTGCTT -TTTAGACTT TGTGTCTACT 

....I. ...I . ... I .... I ....I. ...| ....|. ...| ....|....| ....I. ...| 

65 75 85 95 105 115 

CCTCTCAACT AAACGAAATT TTT-CTAGTG CTGTCATTTG TTATG — GCA GTCCTAGTGT 
TTTCTCAACT AAACGAAATT TTTGCTATGG CCGGCATCTT TGATGCTGGA GTCGTAGTGT 



EMCR5 ' UTR 
229E5'UTR 



....I. ...| I I • ... I .... I ....I. ...| ....|. ...| ....|.... | 

125 135 145 155 165 175 

AATTGAAATT TCGTCAAGTT TGTAA-ACTG GTTAGGCAAG TGTTGTATTT TCTGTGTTTA 
AATTGAAATT TCATTTGGGT TGCAACAGTT TGGAAGCAAG TGCTGTGTGT CCTA-GTCTA 



....I.... I ....|.... | ....|....| ....|....| . ... | .... 1 

185 195 205 215 225 235 

EMCR5 * UTR AGCACTGGTG GTTCTGTC-C ACTAGTGCAC AC-ATTGATA CTTAAGT-GG TGTTCTGTCA 

229E5'UTR AGGGTTTCGT GTTCCGTCAC GAGATTCCAT TCTACAAACG CCTTACTCGA GGTTCCGTCT 



....I. ...I I .... I- ....I.... I ....|. ...| ....|....| .... 

245 255 265 275 285 

EMCRS'UTR CTGCTTATTG TGGAAGCAAC GTTCTGTCGT TGTGGAAACC AATAACTGCT AACC 

229E5'UTR CGTGTTTGTG TGGAAGCAAA GTTCTGTCTT TGTGGAAACC AGTAACTGTT CCTA 
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